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1 CGAGTGCGCT 
51 TGTCTCCTGA 
101 CAGTGAACTT 
151 TTTGGCAAGG 
201 CATGAAGTAC 
251 ACGTCTTCCG 
301 GTGAACCTCT 
351 GGACCTGCTA 
401 AGTTCTCCGA 
451 CTGGACTACC 
501 CAACATTCTC 
551 TTGCCACCAT 
601 AAGCCGTACA 
651 CGGCTACTCC 
701 AGCTGCTGCG 
751 GAGTCCCTGG 
801 GTGGTCCAAG 
851 CCGAGCACCG 
901 GCCGGCGTGC 
951 CGTGCCCAAC 
1001 AGATGATCCT 
1051 AAGAACAAGT 
1101 CTATCTTCAA 
1151 ACAGAGAAAA 
1201 GCCCCTGAGT 
1251 CTCCGCCCTG 
1301 AGGCCGGGAC 
1351 CTGCCAGAGG 
1401 AGCCTGGCTG 
1451 CTCGTGGTGG 
1501 GAGGGCATTT 
1551 GCTGGGCGCC 
1601 GGAGTCTTTG 
1651 AGGTGACCCT 
1701 TTGGAGGTCA 
1751 TCTAGAGATT 
1801 GAATTACAAT 
1851 AAAAAAAAAA 
(SEQ ID NO: 1) 



FEATURES: 

5'UTR: 1 - 192 

Start Codon: 193 

Stop Codon: 1300 

3'UTR: 1303 

Homologous proteins: 

Top 10 BLAST Hits: 



GCTGAGTCCT 
GTGACCGTGC 
CGACCACTTC 
TGTGCATTGT 
ATGAACAAGC 
GGAGCTGGAG 
GGTACTCCTT 
CTGGGCGGGG 
GGACACGGTG 
TGCGCGGCCA 
CTGGATGAGA 
CATCAAGGAC 
TGGCTCCGGA 
TTCGAGGTGG 
AGGATGGAGG 
TGCAGCTGTT 
GAGATGGTGG 
GCTCTCCAGC 
TGTGGGACCA 
AAAGGCCGTC 
GGAGTCCAGG 
CCCGGGACAA 
GACTGCCTCG 
GCTGAAGAGG 
CCAGGGATGC 
CCCATGTGCG 
GCCCGTGGTC 
GAGGGCCATG 
GCGGTGCCCA 
TGCAGGGGAC 
CCCGTGGGCC 
CTCTTACCGC 
TCCCTGCTCA 
TGCAGCACAG 
AGGGCATGGG 
CAAACTGCTC 
GTGAGAACCG 



GTAGATAAAG 
AGCCGTGGGC 
CAGATCCTTC 
GCAGAAGCGG 
AGCAGTGCAT 
ATCCTGCAGG 
CCAGGACGAG 
ACCTGCGCTA 
AGGCTGTACA 
GCACATCATC 
GAGGACATGC 
GGGGAGCGGG 
GATCTTCCAC 
ACTGGTGGTC 
CCCTATGACA 
CAGCACCGTG 
CCTTGCTGCG 
CTCCAGGACG 
CCTGAGCGAG 
TGCACTGCGA 
CCCCTGCACA 
CAGCAGGGAC 
ATGCCATCCA 
AGCCAGGACC 
TGCGGAGCCT 
GCCCCATTTG 
CTCACCCCTT 
GGCCGAGGCC 
CAGTGCCCCG 
AAGAGGCTGT 
CCCGAGACCC 
TCACGGGGAG 
GCCCGGAGGC 
GCCGCGGGTG 
TTGGGGTAGT 
CAGCAATTTC 
CACAAAAAAA 



CCGCCAACCC 
GCCATAGAAA 
GGGCCATTGG 
GACACGGAGA 
CGAGCGCGAC 
AGATCGAGCA 
GAGGACATGT 
CCACCTGCAG 
TCTGCGAGAT 
CACAGAGATG 
ACACCTGACC 
CGACGGCATT 
TCTTTTGTCA 
GGTGGGGGTG 
TCCACTCCAG 
AGCGTCCAGT 
GAAGCTCCTC 
TGCAGGCAGC 
AAGAGGGTGG 
CCCCACCTTT 
AGAAGAAGAA 
AGCTCCCAGT 
GCAAGACTTC 
TCCCGAGGGA 
GTGGAGGACG 
CCCCTCGGCC 
GAGCTGCTTT 
TGGCATTCAC 
GACACATTTC 
GGGTGCAGGG 
GCCTAGATGG 
CTGGGGCCAT 
TGTGCACGGC 
CCCCAGGCTC 
GGGTGGGGAG 
TGTATAGTTT 



CGGGGACTGG 
GCAGAGAAGG 
GAAGGGCAGC 
AGATGTACGC 
GAGGTCCGCA 
CGTCTTCCTG 
TCATGGTCGT 
CAGAACGTGC 
GGCACTGGCT 
TCAAGCCTGA 
GACTTCAACA 
AGCAGGCACC 
ACGGCGGGAC 
ATGGCCTATG 
CAACGCCGTG 
ATGTCCCCAC 
ACTGTGAACC 
CCCGGCGCTG 
AGCCGGGCTT 
GAGCTGGAGG 
GCGCCTGGCC 
CCGAGAATGA 
GTGAI I I I IA 
GCCTCTCCCC 
AGGCGGAACG 
GGGAGCGGCT 
GGAGACTCGG 
GTTCCCACCC 
ACACCTCAGG 
GACACCTGTG 
AGGAAGCGCT 
GGATGGGACA 
CCTCGTCACA 
GGCTCAGTTC 
GTGAATGTTT 
TCACCTCTGA 
AAAAAAAAAA 



Sequences producing significant alignments: 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



103000001515936 /aftid=gi | 10946600 /def=ref |NP_067277.1| hy 
120000042903164 /altid=gi | 13358640 /def=dbj |BAB3 3045.1| (AB 
87000000001314 /altid=gi I 8923754 /def=ref |NP_060871.1| gene 
87000000001426 /altid=gi | 7161864 /def=emb| CAB76566.il (AD25 
108000024647823 /altid=gi | 12730486 /def=ref |XP_003392. 2 | ge 



18000005184360 /altid=gT 
18000005004115 /altid=gi 
18000004912236 /altid=gi 
18000004910302 /altid=gi 
89000000197925 /altid=gi 



7505957 /def=pir| |T23688 hypotheti . 
1730069 /def=sp|P54644|KRAC_DICDI . 
464395 /def=sp|P28178|PK2_DICDI PR. 
1170689 /def=splP42818|KPiaj\RATH . 
7295638 /def=gb| AAF50945.il (AE003. 



Score 
(bits) 
760 
733 
556 
554 
423 
335 
217 
203 
202 
201 



E 

value 
0.0 
0.0 
e-157 
e-156 
e-117 
le-90 
4e-55 
8e-51 
le-50 
2e-50 
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ESII 

Sequences producing significant alignments: 
gi 13032240 /dataset=dbest /taxon=960. . . 
gi 6588496 /dataset=dbest /taxon=9606 . . . 
gi 883123 /dataset=dbest /taxon=9606 /... 
gi 946492 /dataset=dbest /taxon=9606 /... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gi 13032240 prostate 

gi 6588496 /lung 

gi 883123 /whole brain 

gi 946492 / Adult brain 

Tissue Expression 
whole brain 
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1 MYAMKYMNKQ QCIERDEVRN VFRELEILQE IEHVFLVNLW YSFQDEEDMF 

51 MWDLLLGGD LRYHLQQNVQ FSEDTVRLYI CEMALALDYL RGQHIIHRDV 

101 KPDNILLDER GHAHLTDFNI ATIIKDGERA TALAGTKPYM APEIFHSFVN 

151 GGTGYSFEVD WWSVGVMAYE LLRGWRPYDI HSSNAVESLV QLFSTVSVQY 

201 VPTWSKEMVA LLRKLLTVNP EHRLSSLQDV QAAPALAGVL WDHLSEKRVE 

251 PGFVPNKGRL HCDPTFELEE MILESRPLHK KKKRLAKNKS RDNSRDSSQS 

301 ENDYLQDCLD AIQQDFVIFN REKLKRSQDL PREPLPAPES RDAAEPVEDE 
351 AERSALPMCG PICPSAGSG 
(SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 AS N_G LYCOS YLATION 

N-glycosylation site 

288-291 NKSR 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 2 

1 75-77 TVR 

2 245-247 SEK 



[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 4 

1 42-45 SFQD 

2 226-229 SLQD 

3 298-301 SQSE 

4 300-303 SEND 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

152-157 GTGYSF 



[5] PDOC00100 PS00108 PROTEIN_KINASE_ST 

Serine/Threonine protein kinases active-site signature 

95-107 IIHRDVKPDNILL 

Membrane spanning structure and domains: 
Helix Begin End Score Certainity 
1 185 205 0.727 Putative 
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BLAST Alignment to Top Hit: 

>CRA | 103000001515936 /altid=gi | 10946600 /def=ref |NP_067277.1| 
hypothetical senne/threonine protein kinase [Mus 
musculus] /org=Mus musculus /taxon=10090 /dataset=nraa 
/length=488 
Length = 488 

score a 760 bits (1942), Expect =0.0 

Identities = 371/399 (92%), Positives = 381/399 (94%), Gaps = 1/399 (0%) 
Frame = +1 

Query: 103 vnfdhfqilraigkgsfgkvcivqkrdtekmyamkymnkqqcierdevrnvfreleilqe 282 

VNFDHFQILRAIGKGSFGKVCIVQKRDTEKMYAMKYMNKQQCIERDEVRNVFRELEILQE 
Sbjct: 89 VNFDHFQILRAIGKGSFGKVCIVQKRDTEKMYAMKYMNKQQCIERDEVRNVFRELEILQE 148 

Query: 283 IEHVFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVQFSEDTVRLYICEMALALDYL 462 

IEHVFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVQFSEDTVRLYICEMALALDYL 
Sbjct: 149 IEHVFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVQFSEDTVRLYICEMALALDYL 208 

Query: 463 RGQHIIHRDVKPDNILLDERGHAHLTDFNIATIIKDGERATALAGTKPYMAPEIFHSFVN 642 

R QH 1 1 H RDVKPD N I L LD E+G HAH LTD FN I ATI I KDG E RATALAGTK P YMAP EI FH S FVN 
Sbjct: 209 RSQHIIHRDVKPDNILLDEQGHAHLTDFNIATIIKDGERATALAGTKPYMAPEIFHSFVN 268 

Query: 643 GGTGYSFEVDWWSVG\^YELLRGWRPYDIHSSNAVESLVQLFSTVSVQWPTWSKEr4VA 822 

GGTGYS FEVDWWSVGVMAYEL LRGWRPYDI HSS NAVES LVQLFSTVSVQYVPTWSKEMVA 
Sbjct: 269 GGTGYSFEVDWWSVGVMAYELLRGWRPYDIHSSNAVESLVQLFSTVSVQYVPTWSKEMVA 328 

Query: 823 LLRKLLTVNPEHRLSSLQDVQAAPALAGVLWDHLSEKRVEPGFVPNKGRLHCDPTFELEE 1002 

LLRKLLTVNPEHR SSLQD+Q AP+LA VLWD LSEK+VEPGFVPNKGRLHCDPTFELEE 
Sbjct: 329 LLRKLLTVNPEHRFSSLQDMQTAPSLAHVLWDDLSEKKVEPGFVPNKGRLHCDPTFELEE 388 

Query: 1003 MILESRPLHKKKKRLAKNKSRDNSRDSSQSENDYLQDCLDAIQQDFVIFNREKLKRSQDL 1182 

MILESRPLHKKKKRLAKNKSRD+SRDSSQSENDYLQDCLDAIQQDFVT FNREKLKRSQ+L 
Sbjct: 389 MILESRPLHKKKKRLAKNKSRDSSRDSSQSENDYLQDCLDAIQQDFVI FNREKLKRSQEL 448 

Query: 1183 PREPLPAPESRDAAEPVED-EAERSALPMCGPICPSAGS 1296 

EP P PE+ D + D EAE +ALPMCG ICPS+GS 
Sbjct: 449 MSEPPPGPETSDMTDSTADSEAEPTALPMCGSICPSSGS 487 (SEQ ID NO: 4) 

>CRA | 120000042903164 /altid=qi | 13358640 /def=dbj | BAB33045. 1| 

(AB056389) hypothetical protein [Macaca fascicularis] 
/org=Macaca fascicularis /taxon=9541 /dataset=nraa 
/length=368 
Length = 368 

Score = 733 bits (1872), Expect = 0.0 

Identities = 358/369 (97%), Positives = 361/369 (97%) 

Frame = +1 

Query: 193 MYAMKYMNKQQCIERDEVRNVFRELEILQEIEHVFLVNLWYSFQDEEDMFMWDLLLGGD 372 

MYAMKYMNKQQCI ERDEVRNVFREL ILQEIEHVFLVNLWYSFQDEEDMFMWDLLLGGD 
Sbjct: 1 MYAMKYMNKQQCIERDEVRNVFRELGILQEIEHVFLVNLWYSFQDEEDMFIWVDLLLGGD 60 

Query: 373 LRYHLQQNVQFSEDTVRLYICEMALALDYLRGQHIIHRDVKPDNILLDERGHAHLTDFNI 552 

LRYHLQQNVQFSEDTVRLYICEMALALDYL GQHIIHRDVKPDNILLDERGHAHLTDFNI 
Sbjct: 61 LRYHLQQNVQFSEDTVRLYICEMALALDYLCGQHIIHRDVKPDNILLDERGHAHLTDFNI 120 

Query: 553 ATIIKDGERATALAGTKPYMAPEIFHSFVNGGTGYSFEVDWWSVGVMAYELLRGWRPYDI 732 

ATIIKDGERATALAGTKPYMAPEIFHSFVNGGTGYSFEVDWWS+GVMAYELLRGWRPYDI 
Sbjct: 121 ATIIKDGERATALAGTKPYMAPEI FHS FVNGGTGYS FEVDWWSLGVMAYEL LRGWRPYDI 180 

Query: 733 hssnaveslvqlfstvsvqwptwskeiwallrklltvnpehrlsslqdvqaapalagvl 912 

HSSNAVESLVQLFSTVSVQYVPTWS+EMVALLRKLLTVNPEHR SSLQDVQAAPALAGVL 
Sbjct: 181 HSSNAVESLVQLFSTVSVQYVPTWSREMVALLRKLLTVNPEHRFSSLQDVQAAPALAGVL 240 

Query: 913 WDHLSEKRVEPGFVPNKGRLHCDPTFELEEMILESRPLHKKKKRLAKNKSRDNSRDSSQS 1092 

W HLSEKRVEP FVPNKGRLHCDPTFELEEMILESRPLHKKKKRLAKNKSRDNSRDSSQS 
Sbjct: 241 WGHLSEKRVEPDFVPNKGRLHCDPTFELEEMILESRPLHKKKKRLAKNKSRDNSRDSSQS 300 
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Query: 1093 ENDYLQDCLDAIQQDFVIFNREKLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCG 1272 

ENDYLQDCLDAIQQDFVIFNREKLKRSQDLP EPLPAPE RDAAEPVEDE E+SALPMCG 
SbjCt: 301 ENDYLQDCLDAIQQDFVIFNREKLKRSQDLPSEPLPAPEPRDAAEPVEDE-EQSALPMCG 359 

Query: 1273 PICPSAGSG 1299 
PICPSAGSG 

Sbjct: 360 PICPSAGSG 368 (SEQ ID NO: 5) 
Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



PF00069 Eukaryotic protein kinase domain 197.7 

CE00359 E00359 bone_morphogenetic_protein_receptor 9.8 

CE00022 CE00022 MAGUK_subfamily_d 9.2 

CE00031 CE00031 VEGFR 1.6 

CE00292 CE00292 PTK_membrane_span -92.7 

CE00291 CE00291 PTK_fgf_ receptor -116.1 

CE00287 CE00287 PTK_Eph_orphan_receptor -125.0 

CE00286 E00286 PTieEGF_receptor -131.8 

CE00290 CE00290 PTK_Trk_family -206.7 

CE00016 CE00016 GSK_glycogen_synthaseJ<inase -248.2 



Parsed for domains: 

Model Domain seq-f seq-t 



hmm-f hmm-t 



score E-value 



CE00022 


1/1 


86 


107 .. 


133 


154 . 


CE00359 


1/1 


95 


144 .. 


272 


326 . 


CE00031 


H 1 


79 


171 .. 


1051 


1141 . 


CE00286 


1/1 


3 


212 .. 


1 


263 [ 


CE00287 


1/1 


7 


212 .. 


1 


260 [ 


CE00290 


1/1 


4 


215 .. 


1 


282 [ 


PF00069 


1/1 


2 


228 


26 


271 . 


CE00292 


1/1 


5 


233 .. 


1 


288 [ 


CE00291 


1/1 


1 


233 r. 


■ 1 


285 [ 


CE00016 


1/1 


1 


303 [. 


1 


433 [ 



9.2 
9.8 
1.6 
-131.8 
-125.0 
-206.7 
197.7 
-92.7 
-116.1 
-248.2 



0.013 
0.044 
1.3 
0.0056 
2.8 
0.58 
1.8e-55 
0.0016 
0.041 
0.0061 



1.8e-55 
0.044 
0.013 
1.3 
0.0016 
0.041 
2.8 
0.0056 
0.58 
0.0061 
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1 GATGGAGCCC AGGCCACATG CTTGGGAAAG CGGGTGGTCC TGGGTGAGCC 
51 TGTCTGGCAG GAGGGTGAGG TCCTGTCTGG AGTAGACACC CGGTCTGTGT 
101 CCGCCGCACA TGTGACAGGG CCCAGAGGGC ACGGGAAGCC CAGGTGCCCT 
151 GTCCCCTGGT GAGTGGGCTG TGGGGGTTGA GCACCCCAAG GGAAGGCGTG 
201 TTCTTCCAGA GATTCCCCCG TAAGAGCTGA GCTGCATCGT GAGCAGGAGG 
251 GGCAGAGGGA GCCTGGAGAG GGTTGGGTCG CCTGGGCAGG CAGCTTTCTG 
301 GGGTGGTGCA CAAGCGACAG AAAGCCTGAG GCTTGGGACC GGCTGAAGAC 
351 GGCCTTGGGC TGCGGGTGCT GGCGCCGCTC CAGGCAGAGG TGGGCTCTGC 
401 AGCCCCCCCC CCCCCGCCAG CTTCCCCCCC AGCAGCAAGA GCCCTGGCCC 
451 CAGCCAGTGG CCCCAAGCCA GGCTCCTGAG GGGCAGAGGG TGAGGGCCGG 
501 ATTTTCCACC ATATTTGTCT CACAGCCTGT CTGGTCCCAG CCCCAGGGCA 
551 ACAAACAGCC TTTCTGGAGC AGTTTCCAGA CCTGCAGTGG CCGCCTTGAG 
601 CCTGCAGTGA CCGTCTGCAG GAGGCCGCGG GTGCTGGGGC TGGCGCAGGA 
651 AAGCACCGTT GCTTCTGCGC CTGTGCAGAG TGAGGCTGGG GCTTCCATCC 
701 CGGGCACGGG ACTCCTCGGC CTCCTGCGGC CGTGTGCATG GGAGGAAGGC 
751 CGTGCTGCCG AGCCACGCGA CTCTGCCCCG TTGGCAGTGG GAAGCGGCAG 
801 GAGGGGGTCC TGCCAGGGGC AGCCAGGGGC TGCTGCAGCT TACGCTCACT 
851 GTCATTCTGA AACCCTCAAC TGGCTTTCAA AATAACAATT TAAAAAATGG 
901 TCATAGGAAA TGCAGGAAGT TCAGGAGAAA TCCCGCCCCG CCCCGCCCTC 
951 CCAAGGCGGC CTCGTTCAAG CTTAGTCTCC GTCTGTTCTC GGGCTGCCTG 
1001 CAGCTGCCCC CGCTCCTCAG CAGGTGTGGC CGCGTGTTCA GGAGCCCCCA 
1051 TCAGCACACG CGCCTCTGGG CAGCCCCCAA CACAAGGCTC TTCCTGTCCC 
1101 TTCAGGCTCA GCTTTCCCCT CCCACCGGGC AGGGAGGTGC TGAGGCCACG 
1151 CCTGTTGTCA GCTTCCTGGA GAGGCCATAC TTAGAGCCCA TGGTGCCAGG 
1201 CCAAGCCACT GTCTCCCGCA CCACAGTGCT GCAGCCGGAT CCACCCAGGC 
1251 CACCGCTTGG CACCATCAGA CATGCTTTCT TAGTTTGGGC CAGCCCGGTG 
1301 CTCTGTGCAC AGTGTGACTC CCAGTGGCCC CTTGCGGGAG GGAGGGTCAC 
1351 CGCTTTGCTT CACAAAAGGA CTGAGTCTCA GGGAGGGGTC TCCAGTAAGG 
1401 GCCCCGGAGC CAGGATGTGA CTGAGACAGG TGCCTCCAGG GCCACACACT 
1451 AACCAGTACA GGAACCTCTG GGGGCAGAAT CATGGCCTCC AAAACACCCA 
1501 CATCCAAATC CCAGAACAAA CATGTTACCA TTCGTGGCAG AGAGGAATGA 
1551 AGATTGCAGA TTGAATTAAG GTTGTTAATC ATCTGACTTA ATTTTTTTAG 
1601 AGACAGGGTC TCGCCCTGTC ACCCAGGCTG AGTTCAGTGG TACAGTCGTG 
1651 GCTCACTGCA GCCTCAACCT CTCAGGCTCA AGCAATCCTC CCTTCTCAGT 
1701 CTCCTGAGTA GCTGGGACTA CGGGTGTGTG CCACCACGCT TTCCGGGCAT 
1751 GCAGCCAGGA GCCCAGGGCC ATCTGTGGCC CACCTTGAGA TCCAGAATCA 
1801 TCCATTTCCT CCAGGCCCCC TGCTGGGCTC CAACTCCTTG AGGACCAGAG 
1851 AGCAGAGGTT GTGGAAGGCC TTGGAAACGG GTCTGGATTA CCTGTCCTGG 
1901 GAAGGTCTCT CCCAACCTGA GTGTCAGACA GGGGTTAGCT CTGCTGCTCA 
1951 CAATTTTGTG CCTTAATTCC TGGCTTCCCT TTGGGGATCT TCATCCTCAA 
2001 TTCTGATTGA CATCCTTGGC CACAAGGGAC CCCCCTGCTC ATTGATGCTT 
2051 CTCACCCGTC ACCTCACTCT CATCCTCACT GCTAAGCAAT TAGCCGTGTG 
2101 TTTGCGGCAT CAGTGTTGAC ACCGATGATC CATGCTCAGA GGGTACAGGC 
2151 CTGAAGAGCT ATGTGGGGAC TGGCGCCCCG GAGGGGGTCC CGCTGTGGTG 
2201 GCAGCGGTGG CCCCCAAGCC CCACGCTCAC TCTGTGTGTC TCCTTGCAGG 
2251 ACAGGGTGAG GGGCCTTGGC CTCACGGTGT TGAGACGGGA GTCGGTCTCT 
2301 GGAGTGTGGA GTGATGTGCG TCCAGGGTGA AGCTGCAGCC ATGGTTGATG 
2351 AGGCTTCCTG AGCGCAGGGC TCTGTGCTCA GTGGG I I I IC GCATTCATTC 
2401 CCCAGTACCC CTCCGGGCTG CTTGCTGCTC CAAGCCCTGG AAAAGGATGT 
2451 TGGGGTTTAG GAAGGCAAAA CTCCATGCCC AGGTCGTGGC TGGTGAGGGG 
2501 CGCTCCCCGC ACAACCAGCC ACTGCTTGGC TCCAACTCAC GCCTGGATGC 
2551 TGTTAGGCTG GACCCTGTCT GTTTGCAGAT AGCGCCTGTT GACAGATGTG 
2601 TCCTGCTGCA GACTTGAAAC GCAGGACTGA GTCTCAGGGA GGGGTCTTCA 
2651 CTAACAGCCA TGGAGCCAGG ATGTCACTGA GACAGGTGCC GCCAGGGCCA 
2701 CACACTAACC AGTACAGGAA CCTCTGGGGG GCAGAATCAC GGCCTCCAAA 
2751 ACACCCACAT CTTAATCCCC AGAGCAAATA TGTTACCATA TGTGGCAGAG 
2801 AGGAATGCAC CTGCCGCGTC TAGGAGGCAG AGGGGCTGCG GCGCGTCCCC 
2851 AGGTGTCCCC TTGTGTCCTG ACGATGCGTC CCCAGGCATC CCCATGCATT 
2901 TCCAGGTGTC CCCACGTGTC CCCAGGCGTC CTCAGGCGTC CTGCCGAACA 
2951 GCCCTGTGCC CTCCAGGTGT GCATTGTGCA GAAGCGGGAC ACGGAGAAGA 
3001 TGTACGCCAT GAAGTACATG AACAAGCAGC AGTGCATCGA GCGCGACGAG 
3051 GTCCGCAACG TCTTCCGGGA GCTGGAGATC CTGCAGGAGA TCGAGCACGT 
3101 CTTCCTGGTG AACCTCTGGT GAGCCTGCCA TGGCCTCGCT GCAGAAAGAC 
3151 TTACCGTCCT GAAGCCGGGA AGGCACTGGC TATCCTCTCC TGCCCTTGGT 
3201 GCTCCTTGGC CCTCGTGCCT CGGAGATGCC TCTGCCACCC ACAGGCCCTC 
3251 TCTAGCCCTC CTCATCTATC CCCCTTCCTG TGCCCCAGGC CTGGCAGTGG 
3301 CCCAGGTGGC CATGACATGC TGGGGTTGGT TAATGCAGTG TCTCTTCTGA 
3351 GCCTGCCGGA AGACCAGGGC TTCCCTACAA TGGAGATGTG CTCCCATGGA 
3401 GTCTCTGGCA CTAGTCAGAG AGGGAGAGAG TTTAGGGACT GAAAAACTCA 
3451 CCACTGTCAT CACCATCACC ATCACCACCA TCATCGCCAT CACCACCACC 
3501 ATCACCACTG CAACCATCAT CACTCTCATC ACCATCATCA CTATAATCAA 
3551 CACAATCACT ATTGTCACCA CCATTACCAC CACCACCACG ACCACAATCA 
3601 CTGTTATCAC CATCACCACC ACCCTCAGTC CTCACTACCG TCATCCTCAC 
3651 CATCACCGTC ACCACCACCA CCATCACTGC CATCGTCAAC ACCATGGATG 
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3701 CTGGTTGTTA AATGCCAGCT CTTTGCCCAA CACTGTCAAG AGTGGTACCT 
3751 ACACGGCCTC ATTTTCTGTA ACAACCCTCC GAGGCAAATG TCTGTATCCC 
3801 CATTTTACCG AAGAGGAGGC CGGGCAGCCT GAAGCACCCG GAGCTGGCAC 
3851 TGTAGCTCTG CTCTGCATTT GCCACTCCCA GGTGCCTCTG GCCCCAGCTG 
3901 GGCCACCTCC AGCACAGGGT GGTGTGTCTT TCCTCAGGAT CTGGGCTCAG 
3951 AGCTGCTCTG GGCTGGGGTG CAATCAGTGC CTTGGGCAGG CCCCTCCTCC 
4001 TGGGAATGCC TGGTGGCTGA TGCTGGGGTG GGGCTGTGGT CCTTAGGGGG 
4051 AGTGTGTCAG CTGTGGGAGC AGCCATGACT GGCTCCCCAG CTGTGCGCAC 
4101 AACAGGCCTT CCATCGGTGC CCACAGGTAC TCCTTCCAGG ACGAGGAGGA 
4151 CATGTTCATG GTCGTGGACC TGCTACTGGG CGGGGACCTG CGCTACCACC 
4201 TGCAGCAGAA CGTGCAGTTC TCCGAGGACA CGGTGAGGCT GTACATCTGC 
4251 GAGATGGCAC TGGCTCTGGA CTACCTGCGC GGCCAGCACA TCATCCACAG 
4301 GTGTGTGCGT GGCAGACGGC GCAGGTACCT GCTGAGGTGG GCGGGGCTGA 
4351 AGCAGCCTTA GGTCAGGCTG CCGGCACGGC GGCCGTACTC CCTCAGAGCG 
4401 GGTCTAGCTC CTCTGCCCCA CCCTTGCCTG AGTGCCTGCC CCCAGCTGTG 
4451 GCACCTGTGC CGACCAGGTC AGCCCCATAG CTGTGTGCCT GGTGTCCATC 
4501 TGGGGGGACC TCGTCCCAGC AGCCCCAGCT GAGACTGGGC ACAGTGGGCT 
4551 GTTAGCCCTG GTGGACAGAC CACCAGGCTG GGTCACAGCA GGTGGCCTTC 
4601 ACCTGGTCCA TTTAACTGAA GACTCCTGTT TGCCCATCCA CCACATCCCA 
4651 GGGAATCCAA ACTAATTTTA ACATTAGCTT AAAGCAGATG AAATTAGGAA 
4701 GCAGAGCTGG TGTGATGGCT CTGAAAATAA AATTTAAAAA AAGAAAATAG 
4751 GAAGCAGATT ATGAAGGAAG TGAAATTGGG AAGCAGAAAT TAGGCTGAAA 
4801 TTCCGCAGCA ATGGAACAAA ATGAAAATAT CTGTGAGGTA TATTTTAAAG 
4851 TCGAATGGAC TGGTGTTTGC ATTTCTGCTC TTGGGGACTC GGATGTCTGA 
4901 TTATGACCTA GGCACCAGTC ACTGAGCACT GGCTGTGTAC CTGGAAAAGT 
4951 TGGGACAAAG CAAGAGCCGA GGTGGCTTGG TCTCCTAGAG GCCGAGTCTT 
5001 GGAGGGGGAG GGCAGACCCT GCCAGCAATT GCTCTCGTCC TCTGGGGCTC 
5051 CAGGCCCCCT CCCAGCATCT GGTGCCAGGT GTGTGCTGCT GCCCAGATGC 
5101 CACAGGGAAC GAAGTGGCTG ACTTCATCGC CTCTGCCCCC ACGCAGGGGT 
5151 GTGAGGTCCT AGCATCATCC AAGGACCAAG TCAAGCTCCC AGGCCTCTGC 
5201 CTCGAGTGGG TTGGTGGGAT GTCCTGGGGA CTCCAGGGAT TGTGACAGAG 
5251 ATTCCAGGGC AGAAACAGGG CAGATTCCCA ACTCACCTTC CCACTTTCTG 
5301 CTCTTTCTAG AGATGTCAAG CCTGACAACA TTCTCCTGGA TGAGAGAGGT 
5351 GTGTGGGGTT GGGTGTGGGC AGCCCAGGTG GGTGGTGGCA GGGATGGGCC 
5401 TGTCAGGGGA GGAGGATCCT GCACGCAAGG ATGCATCTCT GGTCCTGGGA 
5451 CAGCCACACC TGACCCCTCT CTGCACAGGA CATGCACACC TGACCGACTT 
5501 CAACATTGCC ACCATCATCA AGGACGGGGA GCGGGCGACG GCATTAGCAG 
5551 GCACCAAGCC GTACATGGGT GAGCCCGAGC TGGGGTTCCA GATGGGAGCT 
5601 GGCTTCCTCC AGGTGGGAAG GACAAGACCT CGGTGGCTTC TCTGTCCCAC 
5651 CCTGGAGGCA GCCTGGTCTC GGGATGTGGC CTCAAGGTGC CGGCCCTGTG 
5701 CCCACGGGTC CGGGCTGTGA CCCCGTGGCA GCTGI I I I IC CTTCTTTCTG 
5751 TCGGAAAGCT CCGGAGATCT TCCACTCTTT TGTCAACGGC GGGACCGGCT 
5801 ACTCCTTCGA GGTGGACTGG TGGTCGGTGG GGGTGATGGC CTATGAGCTG 
5851 CTGCGAGGAT GGGTATGGAC CCCCTGCAGC CCCCGGGCTT GGCTGCCAGG 
5901 CCCCTGCTCT CTGCCCCCAC CAGTGCTGGG GAGGGGGTGG CTGCCCCAGT 
5951 GCCCAGGTGC GCAGGGATGT CTCCACTGTG TCTGAGGAGT CACGCTTTTA 
6001 TCGAAGTGTG TAGTTGGTGA TGGAATGCCT GAGCAGGAGG AGGAAGGACA 
6051 GACTCACTGT GGTTTCCCGG GGCCGCTGCT GGTGCCTGCA GGCCAGCCTC 
6101 TGTGGGGGTG GACAAGGCTG AGAACTGGCC AGCAGGGGTG CTGCCTCGGA 
6151 ACTTTCCACA AAAAGTTTCT TTTGGGGCCC TGTGCTCTTA CCCTTGTGGC 
6201 CACGGCGAGG CCAGTCCTGG AGACCGGGAG GCTGGGGGTC CTCTTGTGGA 
6251 CCGTACCCCT CAGCCCTGCA CAGGACCCCA CCTCTGAGGA AGCCAGCTCC 
6301 CTCCTGGCCC TCTGGGGCTG ATCTACCTGG ACCCAGGCCC CCTGGGATCC 
6351 CAGCCAGATG GGCGCAGCAG CCAGGGCGCA GGACCCAGGC GTAAGCTTTA 
6401 TCTCACCCAG GCTCCTCCGC GGCAGGTGGA GGCCAGGCTG TGCTCAGAGC 
6451 TGTGCCTGCA CTTGGGGTGG GGGGAGGGGG TCCTCTCAGG GCGATGGCAC 
6501 CTGTGTCTGG CATTGTTCTG GGTGTCCTGG GGGCCAGGAG GACCTGCCCA 
6551 GCACTGCCTC CCTGTCTCCA GAGGCCCTAT GACATCCACT CCAGCAACGC 
6601 CGTGGAGTCC CTGGTGCAGC TGTTCAGCAC CGTGAGCGTC CAGTATGTCC 
6651 CCACGTGGTC CAAGGAGATG GTGGCCTTGC TGCGGAAGGT GAGCCCCCAT 
6701 CCCTGAGCCT CCTCACCCTC CGAGCACCCA CCTCCCTCCC TCACTTACCT 
6751 GCGGCTCGGG ACACCCCCTC CAGTGCACAG TTAGTGCCGC TTCCTGGCAG 
6801 GCACAGATCC CTTCACTGCA ACCTGTGGGG GCCTCCGCAG ATGGCAGCCC 
6851 CAAGCCCCAG GAAGCGAGCT GGTGGCAGGC TCTGTGGCCC TCTCATGGCG 
6901 AGCCCTACCA GGTCACTGGT GTCCTGGTGA TCCCCTGAGC TGCGTCTCCA 
6951 GGCACCCTCA CAGCAGGCTT GTGCGCCCGC CCTGCCGGTC ACCACAGAGG 
7001 AGCCCTGAGC CACGTCCCCT TGTCCCACGT ATTGGAAGGA GGGTGTCGGG 
7051 TGGGTGGGTT CAAGCCCATG CTATCTCCGG GACCCTTTGC CCCAGTGCCT 
7101 CCTGGGGAAG GTGGGCAGCC ACTGCCCACT GCAAACACCT CTCAGGGGAC 
7151 ATCAGCCTGG CAGGGACACG GGCAGGGGTG TGGCCATCAG TAGTGCCTCC 
7201 CATTTGTGAT CTGGTGCTAG GCTGGGGCTG TGCCTTGACT GGGCTGCACA 
7251 GCTCCCTGTT CCGAGCCTCG CATTAACACC ATAGGGGGTT CGGAGTCAGA 
7301 GCCAGGCCCA GGCAGGACAG GGAGGGGAGT GAGTGTGCCA CACGGGCCCG 
7351 GCTGCCTCCC GGCCCCCGTG TCTCAGGCAG GTGGGGCCTC CTGCCCTGGA 
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7401 ATTGTAGCCA AGCAGCCTAA 
7451 TCTCCCCAGC CCCGAGAGTC 
7501 AGTCACACCT GCCCAGGTGG 
7551 CTGCGTGGGG ACTGGGGATG 
7601 CCTCGCACGG GGTCTGGGTC 
7651 GCCAGCGCTG GGATGTTGCT 
7701 ATCTGGCCTG TGGAGGTCCC 
7751 AGTTACTGTG AGGCCAGCCA 
7801 CGGCTGCCAC CTCCTGGCAA 
7851 CTGCTGTCCT CTGGCGCCAG 
7901 TCCCTTGGGC CACATGCTGA 
7951 CGTCGGCACC ACCCTTCCTC 
8001 CTGAGGCTTG GTGGGACCCT 
8051 GCTATTCCGT GCTGGTGGGC 
8101 GGTTGCAGCT CCTCACTGTG 
8151 GACGTGCAGG CAGCCCCGGC 
8201 CGAGAAGAGG GTGGAGCCGG 
8251 CTCAGGTGGG GGGCCCTGGG 
8301 CTCAGCACCC ATCCCTCTGT 
8351 TTGAGCTGGA GGAGATGATC 
8401 AAGCGCCTGG CCAAGAACAA 
8451 GTCCGTGAGT GCCAGGGCAG 
8501 GGCTCCTCTC TACCACCGAG 
8551 ACGTCTCGGA AGTCCAGCAG 
8601 GACCTCCCTT CTGCCTTTCC 
8651 TGGGCAGGGA CAACCAGGGC 
8701 TGAAGCCTTC GTGCTTGCCC 
8751 GACGGCTATG TGAAGCCCTC 
8801 TGTGTGCCGG GACAGCCATG 
8851 GCATCAGTTG TGTGTGTGGG 
8901 AGGTGTGCTG GCTTTGGTTG 
8951 ACACTCGCCC AGGTGCGTCA 
9001 TGAAGCCCTC ACACTCGCCC 
9051 GCCATGTGAA GCCCTCACAC 
9101 TGCAGGGACA GCCATGTGAA 
9151 CAGTTGTGTG TGTGGGGATG 
9201 GCGTTGATAT TAGTTGTGTG 
9251 TCACCCAGGT GCGTTGATGT 
9301 TGAAGCCCTC AGACTCGCCC 
9351 GACGGCCATG TGAAGCCCTC 
9401 TGTGTGTGGG GACGGCCATG 
9451 ACATCAGTTG TGTGTGGGGG 
9501 AGGTGTGTCG GTGTCAGTTG 
9551 ACACTTGCCC AGGTGCGTTG 
9601 TGAAGCCCTC ACACTCACCC 
9651 CAGGGATGGC CACGTGAAGC 
9701 GTTGTGTGTG CAGGGACGGC 
9751 GTCAGCATCA GTTGTGTGTG 
9801 GCCCAGGTGC GTCAGCATCA 
9851 CCTCAGACTA GCCCAGGTGC 
9901 CACGTGAAGC CCTCACACTC 
9951 CGGGGACGGC CACGTGAAGC 
10001 TGGGGCTGCC AGGACAGGTG 
10051 TGCGTCTCTC TGAGAGGATC 
10101 CCGGCTGTGC GTGAACTACT 
10151 TTGGCTGTCT AGGCAGCGGG 
10201 GATCTCAGTT TCTGTAAACC 
10251 CTGCTTCGGG GTCTCACCAG 
10301 GTCAGGTGCA GCTCTGTTTC 
10351 AATATTGTCT GAGGTCGTGG 
10401 TGAAATGTTT CCCTTAAAAC 
10451 GTGGTAGAGA TGGGA U I I I 
10501 AGCATGGATG GGGTTAATGC 
10551 TGAGTCTTTG ATCCTCTGCT 
10601 GTGGTTCGTG TTCCATGTGA 
10651 CTCCCTTTTA ATCAGCAGAT 
10701 GAGGCCTGAG TCCAAAGTTA 
10751 GGCTCTCCGG GGAGGAAAAC 
10801 GCTGCCTTGG CCTAGACGCA 
10851 GGGTCGGCCT GTGTGTGCGT 
10901 ACCAGGCATT GGATTTAGGG 
10951 TCTTGATGAT ATCTGCAAAG 
11001 GGTACCCAGG ATTAGAATTT 
11051 CCCATACCAC CAGGATGTGG 



AGCCTTGGGG AGGCCTTGCC TGCCGGGGCC 
TCTTAACTCT GCTGTAGCCC CATGAAGCTC 
CTCACAAGGT GGCACTGGGC TAGAGAGGGC 
ACCCACACGC CCAAGCCCAG GTCTGGGAAA 
TGCGGCATTT TCCCTGGAAA GGCGGGAGGT 
TCCCAGGCCA TGCATGGCTG CCCCGGGCTC 
ATGATTCGGT GAAGGAAGTG GCTCTGGGAT 
TGTGCCGAGT GTT AGCCGCT AGCCGGGCCT 
ATCCCAGCAG AGCCTTCCCT GCAGATCCCT 
GGGTTTAGGT AGCAGCACTG AGAACAGGCG 
GCCAGCCACG GTGCTTTGCC TGATGTCGGC 
GCGTGGCCCT GAGGTTCCTG AATTCTGAAC 
CCTCAAGGTG CCCTGGCCTG GGGGTGGCGG 
TGTGGGCCCT GGACCCTCTG ACTCATGCCT 
AACCCCGAGC ACCGGCTCTC CAGCCTCCAG 
GCTGGCCGGC GTGCTGTGGG ACCACCTGAG 
GCTTCGTGCC CAACGTAAGC CTGTGGGCGG 
GATGGATGTG GCGTCCTCCA CGGGCCGGGG 
AGAAAGGCCG TCTGCACTGC GACCCCACCT 
CTGGAGTCCA GGCCCCTGCA CAAGAAGAAG 
GTCCCGGGAC AACAGCAGGG ACAGCTCCCA 
GCTCAGGGCG CGGCGGCGGG CTGGGCTTGG 
CAAGGTGTGT GGGGACCCCT GACAGTGCAC 
ACCGTTTCCT GAAGTCCTGA GAAGGCCAGA 
CAGCCCCCAC CTCGCTCCTT ATGAAGCAGG 
TGGGGTTATG AGTGCACGGG GATGGCCATG 
AGGTGTGCTG GTGTTGGTTG TGTGTGCGGG 
ACACTCGCCC AGGTGCGTCG GCATCAGGTA 
TGAAGCCCTC ACACTCACCC AGGTGCGTCG 
GACGGCCATG TGAAGCCCTC ACACTCGCCC 
TGTGTGCAGG GATGGCCACA TGAAGCCCTC 
GCATCAGGTG TGTGTGCGGG GACGGCCATG 
AGGTGCGTTG ATGTTGTGTG TGCAGGGATG 
TCACCCAGGT GCGTTGATGT CAGTTGTGTG 
GCCCTCAGAC TAGCCCAGGT GTGTCGGTGT 
GCCACGTGAA GCCCTCACAC TTGCCCAGGT 
TGCAGGGATG GCCACGTGAA GCCCTCACAC 
CAGTTGTGAG TGTGCGCAGG GATGGCCACA 
AGGTGTGCTG GCTTTGGTTG TGTGTGCAGG 
ACACTCGCCC AGGTGCGTCA GCATCAGTTG 
TGAAGCCCTC ACACTCACCC AGGTGTGTCG 
GACGGCCATG TGAAGCCCTC ACACTCACCC 
TGTGTGCGGG GATGGCCACG TGAAGCCCTC 
ATATTAGTTG TGTGTGCAGG GATGGCCACG 
AGGTGCGTTG ATGTCAGTTG TGAGTGTGTG 
CCTCAGACTA GCCCAGGTGT GCTGGCTTTG 
CATGTGAAGC CCTCACACTC GCCCAGGTGC 
TGGGGACGGC CATGTGAAGC CCTCACACTC 
GTTGTGTGTG TGGGGATGGC CACGTGAAGC 
GTCGGCATCA GGTGTGTGTG CCGGGACAGC 
GCCCAGGTGT GCCGGCTTTG GTTGTGTGTG 
CCTCATGCTC ACTCAGGCAT GCTGGTATTC 
ACCACGAATC AGGTGGTTGA AGAACAGCAA 
TGAGTCGTAA TGAAATGGTC TCCTTCACAG 
CTGTCTCCTG CAGCTCCCCT GTCTTGATAA 
TAAGGTGAAC CCCTTGGGCA GTTATGTGAT 
GGAAGTCCAG GCATGGTGCA GCTCTGTTCC 
AATGTGAGCT AACATTGAGG TCGTGGCCTT 
CTGCTTCTGG GTCTCCCCAG AGTGTGAGCT 
TCTCATCAGG GATTTGACAG GTGCTGTGGT 
TCGTGTTGGA ATTTGCTTCC TATTGTGATG 
GGGGGCTGAT GGGGCCACGT AGGTTCTTCC 
TGTTGTAGAA GGGTGACTTT AGTCCTCTTT 
ATGTGAGGAC GTGGTGTTCC CAATGTGGAC 
ATGTGATATT CACAATAGAG CATCAACAGG 
TTAAAAAGAA ATGTGTTGTC TCATGGCTTG 
AGATGTCAGC AAAGCCGTGC CCCCTCTGAA 
CAGTCCTTGC CCCTCTACCC TCCGGTAGAG 
TCCCCCCAGC CCCTGCTTCG CTGCCGCGTG 
CTCCATCTCC TCCCCTCTTC TCATAAGGAC 
CCCACCCTGA TCCAGTATGG CCCCATCTTA 
ACCTCACTTC CAAATGAGGT CACATTCACA 
GAGTGTGTCA Mill GGGGA CACAGTTTGG 
CTGATATTCA CCAAGGAGTA GCTATGGTTG 
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11101 TGTGTTGATG TCAGGGTGAC GGTGATGACC CTGGGTCCCT CGGTGGTCCC 
11151 CTTGCCCCTG AGTCTGCCTG AGCCTGTGGT GGATGTCCTG GGAAACTCTT 
11201 GTGCCTCAGC CCCCGTGCAG CCTCCTCAGA CCTGGTGGGC CCTGTGTTGC 
11251 TCCTGGGCAG AAGACGGGTG TCAGTCCCCT CCTCACCATG ATGTGGGGGG 
11301 CAGGGGTGGG GTCATGCCCT GGGTGCCCTG ATTTTGGGGG GAACACGGCC 
11351 CCCCAGTGGG TCAGGCTCCC ATCCTCGCCC CTCCTCCAGG ACGGCTGCCG 
11401 GCAGCCCTGG GTGTCCTCAG GCAGAATTGC TGGTGGAGAG CTGCTGTCTG 
11451 CCAGGTGGCC ACTGTGAGGC ACTGCTGAGA GCCACAGGAT GGTTGGAAGG 
11501 TTCTCGGGGT TGGGGGTTCT TTGGCATTGC CCCCATTGGA TGTTTAAGTT 
11551 TTCCCTACCA GAGCATGTCC AGAGCCAGGG CTCTGGGGTG TAGAAACAGG 
11601 CCCAGGATGA GTTAGGAACC CTCATGGGAG ACTCAGGGAT GGACAGTGTG 
11651 CAGAGCCCAG CTGGCCATGC TGAGTTCCCA GGAGGCTCTG GCTGGGAACA 
11701 GGTAAGGCCA GGCACCTGTG AGCGGGAGGA GCTCGGCTTT GTCTTGGGTT 
11751 GGTTGTGTGG AGATGTTTTG GCTTGAGGGT AGGAGGTGTT CTGAAAGGAA 
11801 AGCATCACTC CAAAAAAAAA GTCCCACTGT TAACCTTGAG GCTGAGAGAG 
11851 Gl I I I IGGAA ACAGCTTTAT TTTGATATAA TTCACATTCC ATGCAATATA 
11901 CAGTGCATCC ATGTAAAGCA TATAATTCCA TGGI I I I IAA TATAGTCACA 
11951 GGGCTGTGCA TTCTCCACCA CAATCTGATT TTAGAACCTT TTCATGTAAT 
12001 GTAAGAGAAA GACCCCACCT ATTAGCAGTC ATGCCCCATT CCCCTCTTCT 
12051 CCCCTCCCCT GGCAGCCACG AAGCTACTTT CCGTCTCTGT AGGGTTGCCT 
12101 GTTGTGGGCG TTTCATGGAA GTGGAGTTAC ACACTATGTG GTCTTTGCAG 
12151 CTGGCTTCTT TCACTTCGCA GGATGCI I I I GAGGCCCGTC CACGTTGTAG 
12201 CCTGTCAGTG CTTCATTCCT GTTGATGGCT GAGTAATATT CCACATATGT 
12251 ATCACCTTTC CTTTATCCAG TCATCAGTTG ATGAGTATTT GTTCTTTCCA 
12301 CTTGTTAACA I I I I ICATTA TCATGAATAA TGCTGCTGTG AACATTCACA 
12351 TACAAGTCTT TGTGTGGATA TGTACTTTTA TTTTTGGGGG GCACATACCT 
12401 AGGCATGAAC CCGCTGGGTC ATATGTGACT CTGTGCTTCA TGTTTGGAGG 
12451 AAACACCTAC CCTTTTCTAC AGCAGGTGTG CCATTTTATG CCCCTACCAG 
12501 CAGTGTGTGA GGGTTCTAAT TTCTCCATAT ATTTACCAAG TCCTGTTATT 
12551 GTCTGGTTTA I I I I I I I AAA AATCATAGTT ATCTTAGTGT GCAGTGATTG 
12601 TGTGGTTATG GTTTGCATTT CTCTGATGAT ATTGAACATC TTTTGAGGTG 
12651 TTTTATCAGG CATTGTGTCT AGAGAAATGT CTATCCAAAT GTTTTAAAAT 
12701 TTTTATTGTC II I I I I I IAG TCTACTCTGA CAATATATTT TAATTGGCAT 
12751 ATTTATTTTA CTTTAI I I I I TTTTAGAGAC AGGGTCTTGC TTTATTACCA 
12801 AAACTGGAGT GCAGTGGTGC AATGAAGGCT CACTTCAGCC TTGACTTCCT 
12851 GGGCTCAAGT GATCCTCCCT GCCCCAGCTG CCAGAATGGC TGGAACTGTG 
12901 GGTGTGCACC ACCACACCTG GCTCATTTGA AAAAAATTTT GTTGTAGAGA 
12951 CAGGGTCTCA CTATGTTGTC CAGGTTGGTC TCAAACTCTT GGAGTCCTCC 
13001 CACCTCAGCC TCCCAAAATG CTGGGATTAC AGATGTGAGC CACTGTGCCT 
13051 GACCTAATTG GTGTATTTTA GACCATTCAC ATTTAAAGCG ACCAGGGAGG 
13101 CTGAGGCAAG AGGACTGCTT GAGTCCAGGA GTTTGAGACC AGCCTGGGCA 
13151 ACAAGGTGAG ACCCATCTGT ATTAGTCTAT TTTCACACTG CTAATAAAGG 
13201 CATACCTGAG TCTGGGTAAT TTATACAGGA AAAAGGTTTA ATGGATTTAC 
13251 AGTTCCACAT GGCTGGGGAG GCCTCACAGT CGTGGAAGGC AAGGAGGAGA 
13301 AAGTCACATC TTACATGGAT GGCGGCAGGC AAAGAGAGAG CTTGTTCAGG 
13351 GAAACTTTTG I I I I IAAAAC CATCGGATCT CATGAGACTC ATTCACTATC 
13401 ATGACAACAG CACAGGAAAG ACCCGCCCCC ATAATTCAAT CACCTCCCAC 
13451 CAGGTTCCTC CCACAACATG TGGGAATTGT GGGAGTCACA ATTCAAGCTG 
13501 AGATTGGGAT GGGGACAGAG CCAAACCATA TCATTCTGCC CCAGCCCCTC 
13551 CAAAATCTCA TGTC CTCA CA TTTCAGAACC AATCATGCCT TCCCAACAGT 
13601 CCCCCATAGT CTTATTTTGG CATTAACTCA AAAGTCCACA GTCCAATGTC 
13651 TCATCTGAGA CAAGGTAAGT CCCTTCTGCC TATGAGTCTG TAAAATCAAA 
13701 AGCAAGTGAC TTCCTAGATA CAATAGGGGT ACAGGCATTG GGTAAATTCA 
13751 GCCATTCCAA ATGGGAGAAA TTGGCCAAAA CAAAGGGGCT ACAGGCCCCA 
13801 TGCAAGTCTG AAATCCAGCA GGCCTGTCAA ATCTTAAAGC TTCAAAATGA 
13851 ACATCTTTGA CTCTATCTCT CACATCCAGG TCATGCTGAT GCAAGAGTTG 
13901 GGTTCCCATG GTCTTAGGCA GCTCTGCCCT TGTGGCTCTG CAGAGTACAG 
13951 CCTTCCTCCC GGCTGCTTTC GTGGGCTGGC ATTGAGTGTC TGTGGCTTTT 
14001 CCAGGTGCAT GGTGCAAGCT GTTGGTGGAT ATACCATTCT GGGGTGTAGA 
14051 GGATGGTGGC CCTCTTCTCA GAGCTCCACT AGGCAGTGCC CCAGTGGGGA 
14101 CTTTGTATAG GGGCACCAAC CCCACATTTC CCTTCTGCAT TGCCCTAGCA 
14151 GAGGTTCTCC ATGAGGGCCC CACCCCTGCA ACAAACTTCT GCCTGGACAT 
14201 CCAGGTGTTT CCATACATCC TCTGAAATGC AGGCAGAGGC TCCCAAACCT 
14251 CAATTCTTGA CTTCTGTGCA CCTGCGGGCT CAACACCACA TGGAAGCTGC 
14301 CACAGCTTGG GGCTTGTACC CTCTGAAGCC ACAGCCTGAG CTGTACCTTG 
14351 GCCCCTTTCA GTCATGGCTG GAGCAGCTGG GATGCAGTGC AGCAAGTCCC 
14401 TAGACTGCAC ACAGCAGAGG GACCCTGGAC CTCGCCCATG AAACCATTTT 
14451 TTCCTCCTAG GCCTCTGAGT CTGTGATGGT AGGGGCTGCC GCAAAGGTCT 
14501 GTGGCATGCC CTGGAGACAT TTTCCCCATT GTCTTGGTGA TTAATATTCA 
14551 GTTCCTTGTT GCTTATGCAA ATTTCTCCTC AGAAAATGGG GTTTTCTTTT 
14601 TTTTCTCTCT I I I I I I I 11 I I I I I I GAGAC AGTCTTGCTC TGTCACCCAG 
14651 GCTGGAGTGC AGTGGTGCAA TGGCGGCTCA TTGCCACTGC AACCTCCGCT 
14701 TCCTAAGTTC AAGTGATTCT CCTGTCTCAG CCTCCCAAGT AGCTGGGATT 
14751 ACAGGCACGC ACCACCACAC CCAGCTAATT TTTGTATTTT TAGTAGAGAA 
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14801 GGGTTTCACC ATGTTGGCCA 
14851 TCTGCCTGCC TTAACCTCCC 
14901 GTGCCCAGCC AGGAGTTTCT 
14951 TTGAACTTTT ATGCTGTTTC 
15001 CCAAGTCACC TCTTGAATGC 
15051 ACCCTAAATC ATCTCTCAAA 
15101 GCAAAATGCT GCCAGTCTCT 
15151 CTGTAGTTCC CAACAAGTTC 
15201 GACTTCATTG TCCATATCAT 
15251 AGTCTCTAGG AAGTTCCAAA 
15301 CCCTCCAGAT GGTTCCAGCC 
15351 TTCCACATTT TCAGGTATCA 
15401 ATTTACTGTA TTAGTCTGTT 
15451 CTGGGAAATT TATACAGGAA 
15501 GCTGGGGAGG CCTCACAATC 
15551 TCTTACATGG ATGGCAGAGA 
15601 AACCATCAGA TCTCATGAGA 
15651 AAGACCCGCC CCCATAATTC 
15701 ACACGGGAAT TGTGGGAGTC 
15751 CAGCCAAACC TTATCACCAG 
15801 ATAACAGTGC ATGCCTGTAG 
15851 CAGGAGTTTG AGACCAGGCT 
15901 AAAAAAATTA TCTGGGTGTG 
15951 GGAGGCTGAG GCAGTAGGAT 
16001 AAGCTATGAT CATGCCCCTG 
16051 TGTCTTGTAA ATAAATAAGT 
16101 CCTTATTTGT AACTGTAGTC 
16151 TCTGCC I I I I CTGGTTTTAA 
16201 CCTCTCCTGG CCTGTTAATT 
16251 GGCCTGGACA TTGCAGTATA 
16301 ACTCTGCCCC CTCATGTGCA 
16351 CAATCCTCCT GTTCCTGATG 
16401 ATGCTATAAT TGCTCATTAC 
16451 LI I I IGGATC AATTAAGAAA 
16501 TCCTCCTCTA AAGTGCTTCC 
16551 AAATCATTTT CCTTTCCCCT 
16601 CAGGTCCAAC AGAGATGAAT 
16651 TTTACTTTCA CCTTTAAAGA 
16701 GGTAGGTTTT TTACTTTCAA 
16751 TGCTTATGTG ATTTCTAACA 
16801 TAGAATTCCT CGCCACCCCC 
16851 TCTTTGGTTT TCTGCAATTT 
16901 TTTTAATATT CATTCTGCTT 
16951 TTGGTGTCTG TCATTAATTT 
17001 TATTCTGTTC TCTTTCTCTT 
17051 GGTGCCGTTT GAAATTGTCC 
17101 TCACTCTTTT I I I I I ICTCT 
17151 CATTTCTTCA AGCTCATAGA 
17201 GAACCATCAA AGGCATCCTC 
17251 ATTTCCTTTT GATCCCTCCT 
17301 CTTCTGTTCC CCATCTGTTC 
17351 TATTAATCAC AGTTATTTCA 
17401 GCCATATCTG GATCTGTTCT 
17451 I I I I I CCTTA AGACCTTGTA 
17501 ACGTGATGTA TCTGGTAATA 
17551 GGTTTATGTA ATCTGGCCAG 
17601 CTGTGGGTAC CAGAGGCTTC 
17651 GGGCTTGCCT AAATCCTCCT 
17701 CTCTGCTGGA ATCCCTGTCA 
17751 AGATGGTGGG GAGGGAACTG 
17801 GGGGGCCTGT GCCCCTTCAC 
17851 GGTGAGACAG GCTAAAGCGG 
17901 GTGAGATAAG CCTTTCCTTT 
17951 GCTCTGGGTG TATTTCACAG 
18001 AGGAAGGGAT CATCCTTGGC 
18051 GGAAACCCAC AAAACGTTGG 
18101 CTGCACACGG CCCCGGCCGC 
18151 ACACCATCAG TTCTGCTCCA 
18201 ACCTGTCTCT CCAGACTTGG 
18251 CTGATGGGTC CAGGAAAAGT 
18301 TGTCGCAGGG ATGGGAGTGA 
18351 AGCCAGAAGT CCTTGCTCGT 
18401 TTATTTCCTG AACGCAGGTG 
18451 CCTCCTGTTC CATGGGCTAT 



GGCTGGTCTT GAACTCCTGA CCTCAGGTGA 
AAAGTGCTAG GATTACAGGC GTGAGCCACC 
TTTCTATTGC ATTGTCAGGT TGCAAATTTT 
CI I I I IAAAA TGGAATGCCT TTAACAGCAC 
TTTGCTGCTT AGAAATTTCT TCTGCCACAT 
TTCAGAGTTC CACAAATCTC TAGGGCAGGG 
TTGCTTAAAG CATAACAAGA GCCACCTTTG 
CTCATCTCCA TCTGAGACCA ACTCAGCCTG 
TATCAGCATT TTGGTCAAAG CCATTCAACA 
CTTTCCCACA TTTTCCTGTC TTCTTCTGAG 
TCTGCCTATT ACCCAGTTCT AAAAAGTTGC 
TTTCAGCAGC GCCCTACTTT ACTGGTACCA 
CTCACGCTGC TAATAAAGAC ATATCCGAGA 
AAAGGTTTAA TGGACTTACA GTTCCACATG 
ATGGCGGAAG GCAAGGAGGA GCAAGTCACA 
GAGCTTGTGC AGGGAAACTT TTGTTTTTAA 
CTCATTCACT ATCATGACAA CAGCACAGGA 
AATCACCTCC CACTGGGTTC CTCCCATGAC 
ACAATTCAAG CTGAGATTGG GGTGGGGAGA 
CTCTATAAAA GACAAAAAAA TTAGGCAGGC 
TTCCAGTGAT GTGAGAGGAT TGCTTGAGTC 
GGGCAACATG GCGAGACCCT GTCTCTACAA 
GTGGGATACA CCTGTGATAC CAGCTACGCA 
TGCTTGAGCC CAGGAGTTCA AGGCTGCAGT 
CACTCCAGTC TGGGTAACAG AGAGACACGC 
GGTCATTTAT ATAGTTCAAT ATGATATCTA 
TATTTATTGG TCTTCCTTTT TCCCTATTTT 
TTAAGCATTT TATATTATTC TAGTTTATCT 
ATACTTCTTT TTGAAATATT TTTAGTGGTT 
CCTTTACCAT ACAGTCTACC TTCACCTGAC 
GTGGGATGCC TTGTGACGGC ACCTCTCGTG 
ACATTGCTGT CATTCATTTC ATTTATCTGT 
ATTGTTACTA CTGTTATTTT AAACAGTTAT 
AATTAAAAAT TTCATTTTAC CTCTATTCAT 
TTTCTTTATG CAGACCCAAG TTGCTGACCT 
GAGGAACTTC GTTTAACATG TCTTATAGGA 
TCCTTCCCTT TTTGTTTGTC CAAAAAAGTC 
ATAATTTCAC TGGATATAGA ATTCTAGATT 
CACTTTAAAT ATTTCACTCC GCTCTCTTCT 
GGCAGTCTGC TCTAATTCTT TTTCTGTAAG 
CACCCCCAGC TCATTTCAAG ATTTTCTCTG 
GAATAATGAT ATGCCTAGGT ACAGAI I I I I 
GGTGTTCTCT GAGCTTCTTA GATCTGTGGT 
CAGAAAATTC CCAGTCATTA TTATTTCAAA 
CTCCTGCTGG AAATCCAATT ATGTGTGTGT 
CACAGCTCTT GGATATTCTG TTCAI I I I I I 
TTGCATTTCA GTTCGAGAAG TTTCTGTTGA 
TCCTTCCTCA TCTGGATCCA GTCTACTGAG 
ATTTCTGTTA CAGTGTTTTC TACTTCCAGC 
TGGAGTTTCC ATCTCTCTAC TTACATTGCC 
TTGCCTGCTT TTCCCCTGGA GCCCTTGCCA 
AATTTCCTGT TTGATAATTC CAACATGGGT 
AATGTGTGCA TTGTCTCTTC ACACTCTATT 
GTCTGCCTTG TAATGTTTGT TGAAAGCTGG 
AGAGCTGAAG TAGATGGGCC TTTAGTGTGA 
GTTTGGGGCA GGGTTTAAAG TCTGCTGTAG 
ACATTTGTTT TCATTTCCCG GGTTGCCCTT 
CCTCAGAGAG AGTCTGCGTC TTGTGGCCCT 
CACTGCGGAG GCCCTGTGGG TGTGTTGGGA 
TTCCACAGTC TGTGACCAAA TCTCAGTCTT 
AGTTGTTGAT CTGCTTTTCC CCTCCCCTTA 
GGTACAGGCT GAAAAACAGT CCTCCCCCAA 
GGAGAGCAAA TTCCATTTGC TGTGGAGAAT 
TGGTGACTGT CCCCATCCCA TGCCAGAGCC 
TTCATTAAGA CCTGGCAGGG TTCCTGGAGG 
GGGCCTCATA AGACCGCAGC CGCAGGAGTT 
TCCCCAGAGC TGCCCAGGAG GTGTTCCCGC 
GGCCAGCAGG TCTCAGCTGT GACTTTGCTC 
GGGTCGCCGC AGCGCTTTGA CCTCAGCTCT 
AATTGATTTT CATTAGTTCA GCAI I I I ICC 
GGCCTTCCAC GTTGTTTGCC GCAGAGCAGA 
CTCTGTGTTT CTTCGTCGGA GTCATGTCTT 
TGTGGTCAGA CAGGAGATTG GCAGGTATTT 
CTTTTCTCTT TCTTGAGGGC ATCATTTGCA 
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18501 GCAGGGAAGT CTTGGACCTT 
18551 GGCTGCTCTG CTTTGTCTAA 
18601 ATTTACTCCC ACACTTTCTT 
18651 TTAGCTGTGT GATTCATTTT 
18701 GGGTCCAGCT TCATTCTCTC 
18751 TTTGCTGAAG AGAGGATTCT 
18801 CTTGTTTTTT GACTTGCCGT 
18851 CGCCGGGACT GCACCATCTT 
18901 ATCAGGATCT GTGAATCCCC 
18951 TATTCTGGGT CCCTGGCATT 
19001 CATCTGAGCT GCCAAGCAGG 
19051 TCCATTTCCC AGGCCCCTCC 
19101 TGTGTCTCAG CATGTGTGAA 
19151 CTGGTCCAGT TACTCAGAAA 
19201 GGGATGGGAG ATGCCAGCAG 
19251 CCTGTGGGGA GTGGCTTCGG 
19301 TTCAAGAGGT CCCTGCCCTG 
19351 GTCAGTCATG GGCGAAGGGC 
19401 TGGCAAAGGG TTCCAGCCAC 
19451 GAGCAGGGCA GATCCAGGCC 
19501 GCGGCGCTAC ACACACCTTG 
19551 GTCACACATC TAGGTGACAC 
19601 CCTTCCAGAT GGTTCTGGAG 
19651 CCTAGGAAAA CATGGCTGTG 
19701 ACTGGGCTGC ACACAGGGCC 
19751 CGCCCACCCC AGCCCCTCCC 
19801 AGCGGCTCAG TCGGACCCTC 
19851 GTCACCTTCT CAGGCCGGCC 
19901 GGTTAGACCA GAGAGTGCTG 
19951 TCTTGATGAC TAAGGACAGG 
20001 GAGGCCACAG TGACAGGGCA 
20051 GAGAGGGCTG CCCGCAGAGA 
20101 CCTTGGCAAG GTGCTGGCCG 
20151 GGACCAGGGT GACACCACCA 
20201 GGCTAGGAAG AGAGAGTCCA 
20251 CCACCACAGA TCCAGGAGAG 
20301 CGTGAGCTGC TCCCCAAACC 
20351 CCTCGTGGGA AGAGGCGCAA 
20401 GCACGTCCGA GTGTGACGCA 
20451 GAACAGCCTT CTGCATCTCC 
20501 CAGAGTACTT GCCACTTGGC 
20551 GCATGGTGGC TCACGCCTGT 
20601 CGGAGCACCT GAGGTCAGGA 
20651 ACCCCGTCTC TACAAAAATT 
20701 CCAGCTACTT GGGAGGCTGA 
20751 GAGTTGGCAG TGAGCCAAGA 
20801 GAGCGAGACC AAAAATAGGA 
20851 GTTCGAGGCA CAGACGACAC 
20901 CTGGGGCCGT GAAAGCATGC 
20951 TGTCACGCAT TGCATAACGA 
21001 GGTGGTCCCG TAAGATGGTA 
21051 CTAGATAGGG TTGAGCGTTC 
21101 ACCTTCAGAG CCCAGCTCCC 
21151 TGACTTCCTA TGGATCTGAG 
21201 TGGCCTCAGC TGCAGGAGGT 
21251 CGACTCCCTG CACTTTTCAC 
21301 GCTCACGCGT TGGTAGTGAC 
21351 TAATGATGCT TTCCCCACAC 
21401 CCACCCCACC CACGCCCTCC 
21451 GCTGCCTGAA GGAAGTAGTC 
21501 GGTCCTGCCA GGGGCAGCCT 
21551 TCTAAGACAG CCCTGGTTCC 
21601 GCCTGGGACA GGTGTGTGTG 
21651 ATGAGACGGG AAAGGCCCGG 
21701 GAGAACAGCT TCTGCCTGGT 
21751 ATCCACCCAG AGGCAAGGGA 
21801 AGTCACCCGC AGAGATGCGA 
21851 CGGGGCACTT CAAATAGCCC 
21901 GCCCGATGGG GATGCCCAGA 
21951 GGACAGCGTC CCCCACGCTC 
22001 CTATCTTCAA GACTGCCTCG 
22051 ACAGAGAAAA GTGAGTGTGT 
22101 AGGAAAATGG GGCTAAGGTT 
22151 TGTGGGCCTG GGGTGCGTGG 



GACGTTGTCC GATGTGGCTG TTTCCTCCTT 
GGATCCATCA CCTCAGCTGA GGCCACAGGG 
CTAAGTCTTG TATAGTTTCA GCTCTTGCGT 
GGGTTAATTT TTATGAACAA TGTCAGGTGA 
ATCGGTGGAT ATCTGACTGT CCTAGCACCA 
TTCCCCATTG AATTGCTTTT GACATCATAC 
TTTATCCCAT TGGTCCAGAC GTCTGCCCTG 
GATAACTGTA GCTTTGTAGC AAGCTTTCAA 
CAGTTTTGTT CTTTTTGGAC ATTATTTGAC 
TCCACTGAAT GCTGAGGGGT CTGACAGTTG 
TTTGTGGCGG TGCTAGGGAC TGAAGCCTGC 
TCGCTGTGGG TGACATCTGG GGTCCGAGGC 
GGTGCCACGG GTGCCCTGAG ATGGGGATTC 
GTGCATCCAG GAGAGACCCC CGCCCTTCTC 
AGCTTGGCTT TCAAGCAGAA ATCTGGAAAC 
ACTTCAGGGG ACCTGGAGCG TCACTTGTGG 
AGGGAGCTGG GCTCTCAAAC ACCCACACCA 
CCCTCTGTGG CCTTCTGACT GTGTGTGTGC 
CCAAGGAGGA GGCAGGGGCT GTCAGAGGAA 
AGGCACAGAC CCTGCCATGG GGTACTGCTG 
GGAAGGGAGT CCCTGTGGAA AGGGGTCGTG 
AGCCCGGCTT GGGCGCTGCT CAGAGCCACC 
CAGCTCCTCA GGCTTCTGGT GGCCTCTCTG 
GACGTTGCAG GATGACCAAC AGCCCCTGCC 
ACGACGGGCG CTCATGTTCT ACATCACTGG 
ACCTTGTGTC GCTGTGAATC GCAGGATCCC 
ATTCCTGAGT AGTCTGAGCC TTAGGTCACC 
CGAGTTTGCA GACTTGTCTG TCTATATCAG 
AGACACAGCA GATCACCCAG CCTGTCCTCT 
TCCCCTGCCA GGATCGTGAC TCCTTTAGGG 
AAGCCTGGAG GGAGAGAGCC ACATGGAGAG 
GCGTGGGAGT CTGCCGGCTT CTTCTCGAGT 
CTCACACCGT GTACGTGTGG GGAACGCCCA 
GGAGGAGCGG GCGGGACAGT CCCCACTCAG 
CATTCCCGCT GCCAGGGTGA AACCCTCACA 
ACACGGAGGG CACTGCCTCG GGGTGGGGAA 
CAAGAAATGT GTTGAGCCCT ATGCTTCCTT 
GGTGAGACCG CTTGTCCTTT ATCTTGGCGG 
CGTCAGCAAA TCCCTGAACC GTTCATCGGA 
CACACTCTGT TCGTGGGTTT ACAGGGTGTC 
AATCAGCATT AATAGGATCC ACAGGGCCAG 
AATCTCAGCA CTTTGGGAGG CCGAAGCAGG 
GTTCAAGACT GGCGTGGCCA ACATGGTGAA 
AGCTGGGTAT GGTGGTGGGC ACCTGTAATC 
GGCAGCAGAA TCACTTGAAC CTGGGAGGCG 
TCACGCCACT GCACTCTAGT CTGGGTGACA 
TCCATGGATA GCAGGCAAGA GTGTCCAGGT 
TGTGACAGGG AAGAGTCCCC TTAGCCCTGG 
TGTTGTCCGT CTCGGGTGAA CGCAGACTGT 
TGCTTCCGTC ACTGGCCAAT CGCATGGGGG 
ACACTGGGTT TTGCTGTACG TTTTGTATGT 
TGGTGTGTAC CCACTCACAC GTCCCCTCCG 
TCCCTCCCAG GGCCTTGGCT GTGACGTGGG 
GTTCTGTGGT CCTCACAAGT GGGCATCCTC 
GGGGGCCCTT TTAATGCCAC CCGAGGCCTG 
TGTGACTTGG CTCATCTGGG TCTGTCATTT 
CAACGTCACC ATCCAAGTTC ACGGTCACCA 
CATGCCAGCG CTGAGCGGCC AGCACCCCCT 
CCCGACCCCT GCGTCCTGGG AAGTGGTCCT 
CTGCCTGCCC ATCACACACC AGTAAGGGTG 
CCGTCCACAA GCTTGCCCTG AGGACCTGCT 
AGGATTCTCT GGGCAGGGCC CCAGAAGCAG 
TGCTGTGATG AGGGGCTGGG AGAACCCGGT 
CAAGGGAGTG GTTTCCAGCA AAGTCCCGCA 
CCTGCAGGCC CCACGGAGCA AGTCCAAGCC 
GCTGGGCCTT GGCATCCTCG GGCTTGGGTG 
GCTCCCCGGG CAGTCTGGCT GCTGGAGGGC 
AGAGGCCGTC ATCCAAAGCC ACAGGTGGAG 
CACTCACTTG AGGGGACATG GGCGGAACCT 
ACGTGTGCCT TTCCATCCAC AGGAGAATGA 
ATGCCATCCA GCAAGACTTC GTGATTTTTA 
TGGGGGTGGG TGGGCGTGGT GGCAGAGAGG 
AAGGTTTTCT TGGCCACGTG AGCGGGCACC 
CCCTGCTCTC TTTGGGGACT CTGAGCAGCA 
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22201 GCTATGGAGG GGAGCGGCGG 
22251 GCTCCACGCG GGGCCCGTGG 
22301 TGCCCAGATG AGGGGTGGTG 
22351 CAACTCCCCC ACCCAGGCAC 
22401 ATGTCAGGCC CCAGGGGACA 
22451 GGCAGGCCAG AAATAGCAGC 
22501 GATGCTCCCT CTGCCCGCAC 
22551 GTCTGAGCTC TTATGGACAG 
22601 AGGGCACTGA CTGGCACCCT 
22651 CTTAGCCGAG CCTCATTCCC 
22701 AGGCCTTTCG GCTTCTGGCT 
22751 ACGTGCCCCC ACCCGTCCCC 
22801 CCTCTCCAGC AGACACCGAG 
22851 TGGCCTATCC CATGAGCCCG 
22901 AGGTGGGGGC ATAACGCCCT 
22951 GGAGCCAGGA CCTCCCGAGG 
23001 GCTGCGGAGC CTGTGGAGGA 
23051 CGGCCCCATT TGCCCCTCGG 
23101 TCCTCACCCC TTGAGCTGCT 
23151 TGGGCCGAGG CCTGGCATTC 
23201 CACAGTGCCC CGGACACATT 
23251 ACAAGAGGCT GTGGGTGCAG 
23301 CCCCCGAGAC CCGCCTAGAT 
23351 GCTCACGGGG AGCTGGGGCC 
23401 CAGCCCGGAG GCTGTGCACG 
23451 AGGCCGCGGG TGCCCCAGGC 
23501 GGTTGGGGTA GTGGGTGGGG 
23551 TCCAGCAATT TCTGTATAGT 
23601 CGCTCGATGT TGCATGTTCT 
23651 GCCGGTGCCT GCCGTTTCTG 
23701 TGGGCAGCTC CTCAGGGCAG 
23751 CCTGTCTCCG TGGCCCTGGA 
23801 CTGGAGGCAG AGGAGCCCAG 
23851 GTGCCGGGCA GTGCTTGGCC 
23901 GGCTGGGTGG TTCTGTGGTT 
23951 TGTGCTTCTC TCTGGTCTCG 
24001 CAGCGCTCCA GACAGGCCTG 
24051 CGGGTCTGTG CCCCTTCCCA 
24101 ACCTGTGTCC ATGAGCCCCA 
24151 CCTTGGCTGG GCTGCGAGAT 
24201 GAAGAACCGG CCATGCCACG 
24251 GCAGCTTCCC GTCCCCCACC 
24301 TAGCCTTCCA GCCCTCAGCA 
24351 CCACCAGGAA GGGTGGCCCG 
24401 CTCGGTGGAA GTCTGGCGAT 
24451 GTGGGGGTTT GAATAAGTGC 
24501 TGTCCTTCTT CGAGACACTG 
24551 ATCGCCCCCC ACACTGGGGC 
24601 AGCTCCCACA TCTGGGCCAG 
24651 CCAGGGCCTT CGCCTGGGAT 
24701 GAGATGGGCC TGTCTGTGGT 
24751 CACCATCGGT GCATGGGCAG 
24801 CTGGCTCTCT GCTCAACAAA 
24851 CCGCTCTTGG GATGAGGACA 
24901 GAGACACAGC ACGCGCCAGC 
24951 GCCTCATAGC TGCCCGGCCT 
25001 CCCACCTGGG TGGCGGTGGT 
25051 TGCTCCCCGG CCTTCTGGGG 
25101 GGGGGGGGGG GGGGGCCTCT 
25151 CCCGCCCTGG ATGTTGGCTG 
25201 GCGCTCCTCC CTGCCATTGA 
25251 CGGACCAGCC CACAGCCAGC 
25301 GGAGAGGAGA CGGGGCCTTC 
25351 CGTCTTTAAT GGGCCTGATT 
25401 GGCGGTGGGT GGCATGCACA 
25451 AAGCTGCTAC TAAGAAGCCT 
25501 AGTGTCTTCC TAAAGGATCG 
25551 AGCTTAGGTG TGGGCCTTGG 
25601 GCT 
(SEQ ID NO: 3) 

FEATURES: 
Start: 3000 
Exon: 3000-3118 



GAGGCCCCTG CCAGGCTCTG GCATGTTTGT 
CTGGAATCTT CTGGGGAGAG ACACATCATT 
ACTTCCTAGG AGGCCCCATC AGAGCCACGT 
GCCCTCAGTC TCTCAGCAGA CCTTTCCTGA 
GAAAGGGCAC AGATGACTGG CAGCAGGCAA 
AGCTGCCACG GTGGGGCCCA AGGGAGGATG 
GGGGCAAGGA GGGCCTCCTG GAGGAGGTGG 
GACGTGCAGG GCAGCACGTG CAGACGGCTG 
GGGGATCAGA CGACCGGGTG AAGAATGAGG 
AAGTCACTGA CCTATGGCAC CTGCACAGTC 
GGAAACATGC CGAGCCTCGC CAGCATGCTC 
AGGCTCCCTG CCAGTGTGTC GGGAGCATGG 
CCTGTGGCCC ACGTTTGGGC ATCCACGCCA 
TGGGCAGGTC ATGGGACCGT GAGGCCAGGG 
CCATGTGTTC CTGCCACCCC AGGCTGAAGA 
GAGCCTCTCC CCGCCCCTGA GTCCAGGGAT 
CGAGGCGGAA CGCTCCGCCC TGCCCATGTG 
CCGGGAGCGG CTAGGCCGGG ACGCCCGTGG 
TTGGAGACTC GGCTGCCAGA GGGAGGGCCA 
ACGTTCCCAC CCAGCCTGGC TGGCGGTGCC 
TCACACCTCA GGCTCGTGGT GGTGCAGGGG 
GGGACACCTG TGGAGGGCAT TTCCCGTGGG 
GGAGGAAGCG CTGCTGGGCG CCCTCTTACC 
ATGGATGGGA CAGGAGTCTT TGTCCCTGCT 
GCCCTCGTCA CAAGGTGACC CTTGCAGCAC 
TCGGCTCAGT TCTTGGAGGT CAAGGGCATG 
AGGTGAATGT TTTCTAGAGA TTCAAACTGC 
TTTCACCTCT GAGAATTACA ATGTGAGAAC 
GCGTACGTCC TGTGTCTGCC TGGCCGTCAG 
GTTGGCCTGG ACTTGGGGCA GCCAGTGGGG 
AGCTCCCGGA CCATGGCTTT GGGGTGGGTG 
GCCGTAAGGC TGTGGAAGGC AGAGACGGTC 
GACAGCACCG TGCACCGTGG AGCCGCCGCA 
CTCCATAAAG GGACGTATCC CTCTCACTGT 
GGAACTGTAA CTAACTGGGT AAACGGCCTG 
CTGGAGGAGG ACGGGCTCAG CCCGTCAGCC 
TGCTGGTTTC CTCTGAGGAA ATGGGTGTGG 
GGACAGCGGC CATAGTGGAC ATGTGCCTAG 
CTGCACCCCT GGCAAACAGG GCCCTCCCGT 
GGAGATGACA ACGGCCAAAG AACATTTGGG 
AGCAGAGTCA GAAGTCCGAG GGGATAGAAT 
CACCCCTGTC CTCTGAATCA TGGCAGAAAC 
GCTCACATGG GGGACACGGC ACCCAAATCA 
GTCTCTGCGA GGGCCCAGAG GCGCCGTGTA 
GTCAGAGACA GGCTCGGGGC AAGGACAGGT 
ATTTGGGGAA CATGGCAGGG TGGTGCACCT 
GTGAGGTGTG GGTGCTGTCT GGTTCCCTTG 
AGAGTGGGAG ATGCTGGTGT GGGGGACATC 
AGGGAGCCCC GGGAAGGAAA TGCTGAGGGC 
CTGCACAGCT TTATAAGCAG CCCAGGGTGA 
TCCCAGAGAC CACGGCAGGA AATTCTCTGT 
GGGCCAGAGA ACCGGTGGCA CAAGGTGTCC 
CAGCGAGTGC CCAGTGACTG CGACCAGGCC 
ACCGTCTGGG AACGTCCACG CACCCTTTAT 
ACCGCAGTCA CACACCGGGG GCTCGGGTCA 
TGAGTGCTGG GCCTGCGTCT GTGAGCAGCG 
GTGTGCTTCA CTTCCACACA CTCCCGTGCA 
TGGCTCGGGC TGTCCGGTGA GAATGTAGGC 
GTCCCGCCCT GGATGTTGGC TGCCCTCTGC 
CCCTGCCGGC CATCTTCCCT GTGAGAGGGT 
GGGGAGAAGA GCTGCGGCTG CAGGAGTCGT 
CAGGCCCCGC CTGTGCAGAG ACGGCGTGGG 
CTTCCATGCA CAGGCGGCTT CAAACCCAGA 
CACATCAGAG GCAGGATGAC TGCCTGTCCA 
GGTTCCTGGC TACAGTGTCC TCAGTGTACA 
ACGGAAATAC ACAATCTGTA ATAAGAGGAC 
CAAAACTTCC CTGGATGAGG GCTACATGGA 
GGTGCGTAAA AGGGACCCTC CACGGGCGGG 
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3119-4126 

4127-4300 

4301-5310 

5311-5348 

5349-5478 

5479-5568 

5569-5758 

5759-5862 

5863-6571 

6572-6688 

6689-8108 

8109-8234 

8235-8322 

8323-8454 

8455-21992 

21993-22060 

22061-22942 

22943-23081 
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snps: 








DNA 








Position 
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G 


A 


Beyond ORF(5') 


4232 


G 
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Exon 


4309 


G 
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intron 


4622 
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4708 
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Intron 


5787 
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Exon 


5884 
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T 


intron 


8481 


c 


T 


intron 


8754 


G 


A 


intron 


8847 


G 


A 


intron 


11114 


G 


A 


Intron 


19741 


A 


G 


intron 


20908 


C 


T 


intron 


22728 


T 


C 


intron 


23406 


C 


T 


Beyond ORF(3') 


24078 


G 


A 


Beyond ORF(3') 


24777 


G 


A 


Beyond ORF(3') 



Protein 

Position Major Mi 
75 T T 



150 



Context: 

DNA 

Position 
1931 



4232 



AGTTCAGTGGTACAGTCGTGGCTCACTGCAGCCTCAACCTCTCAGGCTCAAGCAATCCTC 
CCTTCTCAGTCTCCTGAGTAGCTGGGACTACGGGTGTGTGCCACCACGCTTTCCGGGCAT 
GCAGCCAGGAGCCCAGGGCCATCTGTGGCCCACCTTGAGATCCAGAATCATCCATTTCCT 
CCAGGCCCCCTGCTGGGCTCCAACTCCTTGAGGACCAGAGAGCAGAGGTTGTGGAAGGCC 
TTGGAAACGGGTCTGGATTACCTGTCCTGGGAAGGTCTCTCCCAACCTGAGTGTCAGACA 
[G,A] 

GGGTTAGCTCTGCTGCTCACAATTTTGTGCCTTAATTCCTGGCTTCCCTTTGGGGATCTT 
CATCCTCAATTCTGATTGACATCCTTGGCCACAAGGGACCCCCCTGCTCATTGATGCTTC 
TC^CCCGTCACCTCACTCTCATCCTCACTGCTAAGCAATTAGCCGTGTGTTTGCGGCATC 
AGTGTTGACACCGATGATCCATGCTCAGAGGGTACAGGCCTGAAGAGCTATGTGGGGACT 
GGCGCCCCGGAGGGGGTCCCGCTGTGGTGGCAGCGGTGGCCCCCAAGCCCCACGCTCACT 

CCTCAGGATCTGGGCTCAGAGCTGCTCTGGGCTGGGGTGCAATCAGTGCCTTGGGCAGGC 
CCCTCCTCCTGGGAATGCCTGGTGGCTGATGCTGGGGTGGGGCTGTGGTCCTTAGGGGGA 
GTGTGTCAGCTGTGGGAGCAGCCATGACTGGCTCCCCAGCTGTGCGCACAACAGGCCTTC 
CATCGGTGCCCACAGGTACTCCTTCCAGGACGAGGAGGACATGTTCATGGTCGTGGACCT 
GCTACTGGGCGGGGACCTGCGCTACCACCTGCAGCAGAACGTGCAGTTCTCCGAGGACAC 
[G,A] 

GTGAGGCTGTACATCTGCGAGATGGCACTGGCTCTGGACTACCTGCGCGGCCAGCACATC 
ATCCACAGGTGTGTGCGTGGCAGACGGCGCAGGTACCTGCTGAGGTGGGCGGGGCTGAAG 
CAGCC7TAGGTCAGGCTGCCGGCACGGCGGCCGTACTCCCTCAGAGCGGGTCTAGCTCCT 
CTGCCCCACCCTTGCCTGAGTGCCTGCCCCCAGCTGTGGCACCTGTGCCGACCAGGTCAG 
CCCCATAGCTGTGTGCCTGGTGTCCATCTGGGGGGACCTCGTCCCAGCAGCCCCAGCTGA 
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4309 CCTGGTGGCTGATGCTGGGGTGGGGCTGTGGTCCTTAGGGGGAGTGTGTCAGCTGTGGGA 
GCAGCCATGACTGGCTCCCCAGCTGTGCGCACAACAGGCCTTCCATCGGTGCCCACAGGT 
ACTCCTTCCAGGACGAGGAGGACATGTTCATGGTCGTGGACCTGCTACTGGGCGGGGACC 
TGCGCTACCACCTGCAGCAGAACGTGCAGTTCTCCGAGGACACGGTGAGGCTGTACATCT 
GCGAGATGGCACTGGCTCTGGACTACCTGCGCGGCCAGCACATCATCCACAGGTGTGTGC 
[G,T] 

TGGCAGACGGCGCAGGTACCTGCTGAGGTGGGCGGGGCTGAAGCAGCCTTAGGTCAGGCT 
GCCGGCACGGCGGCCGTACTCCCTCAGAGCGGGTCTAGCTCCTCTGCCCCACCCTTGCCT 
GAGTGCCTGCCCCCAGCTGTGGCACCTGTGCCGACCAGGTCAGCCCCATAGCTGTGTGCC 
TGGTGTCCATCTGGGGGGACCTCGTCCCAGCAGCCCCAGCTGAGACTGGGCACAGTGGGC 
TGTTAGCCCTGGTGGACAGACCACCAGGCTGGGTCACAGCAGGTGGCCTTCACCTGGTCC 

4622 CAGGTACCTGCTGAGGTGGGCGGGGCTGAAGCAGCCTTAGGTCAGGCTGCCGGCACGGCG 
GCCGTACTCCCTCAGAGCGGGTCTAGCTCCTCTGCCCCACCCTTGCCTGAGTGCCTGCCC 
CCAGCTGTGGCACCTGTGCCGACCAGGTCAGCCCCATAGCTGTGTGCCTGGTGTCCATCT 
GGGGGGACCTCGTCCCAGCAGCCCCAGCTGAGACTGGGCACAGTGGGCTGTTAGCCCTGG 
TGGACAGACCACCAGGCTGGGTCACAGCAGGTGGCCTTCACCTGGTCCATTTAACTGAAG 
[G,A] 

CTCCTGTTTGCCCATCCACCACATCCCAGGGAATCCAAACTAATTTTAACATTAGCTTAA 
AGCAGATGAAATTAGGAAGCAGAGCTGGTGTGATGGCTCTGAAAATAAAATTTAAAAAAA 
GAAAATAGGAAGCAGATTATGAAGGAAGTGAAATTGGGAAGCAGAAATTAGGCTGAAATT 
CCGCAGCAATGGAACAAAATGAAAATATCTGTGAGGTATATTTTAAAGTCGAATGGACTG 
GTGTTTGCATTTCTGCTCTTGGGGACTCGGATGTCTGATTATGACCTAGGCACCAGTCAC 

4708 CTCCTCTGCCCCACCCTTGCCTGAGTGCCTGCCCCCAGCTGTGGCACCTGTGCCGACCAG 
GTCAGCCCCATAGCTGTGTGCCTGGTGTCCATCTGGGGGGACCTCGTCCCAGCAGCCCCA 
GCTGAGACTGGGCACAGTGGGCTGTTAGCCCTGGTGGACAGACCACCAGGCTGGGTCACA 
GCAGGTGGCCTTCACCTGGTCCATTTAACTGAAGACTCCTGTTTGCCCATCCACCACATC 
CCAGGGAATCCAAACTAATTTTAACATTAGCTTAAAGCAGATGAAATTAGGAAGCAGAGC 
[T,C] 

GGTGTGATGGCTCTGAAAATAAAATTTAAAAAAAGAAAATAGGAAGCAGATTATGAAGGA 
AGTGAAATTGGGAAGCAGAAATTAGGCTGAAATTCCGCAGCAATGGAACAAAATGAAAAT 
ATCTGTGAGGTATATTTTAAAGTCGAATGGACTGGTGTTTGCATTTCTGCTCTTGGGGAC 
TCGGATGTCTGATTATGACCTAGGCACCAGTCACTGAGCACTGGCTGTGTACCTGGAAAA 
GTTGGGACAAAGCAAGAGCCGAGGTGGCTTGGTCTCCTAGAGGCCGAGTCTTGGAGGGGG 

5787 CACCTGACCGACTTCAACATTGCCACCATCATCAAGGACGGGGAGCGGGCGACGGCATTA 
GCAGGCACCAAGCCGTACATGGGTGAGCCCGAGCTGGGGTTCCAGATGGGAGCTGGCTTC 
CTCCAGGTGGGAAGGACAAGACCTCGGTGGCTTCTCTGTCCCACCCTGGAGGCAGCCTGG 
TCTCGGGATGTGGCCTCAAGGTGCCGGCCCTGTGCCCACGGGTCCGGGCTGTGACCCCGT 
GGCAGCT GI I I \ I CCTTCTTTCTGTCGGAAAGCTCCGGAGATCTTCCACTCTTTTGTCAA 
[C,T] 

GGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGCCTATGAG 
CTGCTGCGAGGATGGGTATGGACCCCCTGCAGCCCCCGGGCTTGGCTGCCAGGCCCCTGC 
TCTCTGCCCCCACCAGTGCTGGGGAGGGGGTGGCTGCCCCAGTGCCCAGGTGCGCAGGGA 
TGTCTCCACTGTGTCTGAGGAGTCACGL I I I I ATCGAAGTGTGTAGTTGGTGATGGAATG 
CCTGAGCAGGAGGAGGAAGGACAGACTCACTGTGGTTTCCCGGGGCCGCTGCTGGTGCCT 

5884 GGTTCCAGATGGGAGCTGGC1TCCTCCAGGTGGGAAGGACAAGACCTCGGTGGCTTCTCT 
GTCCCACCCTGGAGGCAGCCTGGTCTCGGGATGTGGCCTCAAGGTGCCGGCCCTGTGCCC 
ACGGGTCCGGGCTGTGACCCCGTGGCAGCTG I I I I I CCTTCTTTCTGTCGGAAAGCTCCG 
GAGATCTTCCACT U I 1 I GTCAACGGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGG 
TCGGTGGGGGTGATGGCCTATGAGCTGCTGCGAGGATGGGTATGGACCCCCTGCAGCCCC 
[C f T] 

GGGCTTGGCTGCCAGGCCCCTGCTCTCTGCCCCCACCAGTGCTGGGGAGGGGGTGGCTGC 
CCCAGTGCCCAGGTGCGCAGGGATGTCTCCACTGTGTCTGAGGAGTCACGC I \ I IATCGA 
AGTGTGTAGTTGGTGATGGAATGCCTGAGCAGGAGGAGGAAGGACAGACTCACTGTGGTT 
TCCCGGGGCCGCTGCTGGTGCCTGCAGGCCAGCCTCTGTGGGGGTGGACAAGGCTGAGAA 
CTGGCCAGCAGGGGTGCTGCCTCGGAACTTTCCACAAAAAGTTTCTTTTGGGGCCCTGTG 

8481 GTGCTGTGGGACCACCTGAGCGAGAAGAGGGTGGAGCCGGGCTTCGTGCCCAACGTAAGC 
CTGTGGGCGGCTCAGGTGGGGGGCCCTGGGGATGGATGTGGCGTCCTCCACGGGCCGGGG 
CTCAGCACCCATCCCTCTGTAGAAAGGCCGTCTGCACTGCGACCCCACCTTTGAGCTGGA 
GGAGATGATCCTGGAGTCCAGGCCCCTGCACAAGAAGAAGAAGCGCCTGGCCAAGAACAA 
GTCCCGGGACAACAGCAGGGACAGCTCCCAGTCCGTGAGTGCCAGGGCAGGCTCAGGGCG 
[C,T] 

GGCGGCGGGCTGGGCTTGGGGCTCCTCTCTACCACCGAGCAAGGTGTGTGGGGACCCCTG 
ACAGTGCACACGTCTCGGAAGTCCAGCAGACCGTTTCCTGAAGTCCTGAGAAGGCCAGAG 
ACCTCCCTTCTGCCTTTCCCAGCCCCCACCTCGCTCCTTATGAAGCAGGTGGGCAGGGAC 
AACCAGGGCTGGGGTTATGAGTGCACGGGGATGGCCATGTGAAGCCTTCGTGCTTGCCCA 
GGTGTGCTGGTGTTGGTTGTGTGTGCGGGGACGGCTATGTGAAGCCCTCACACTCGCCCA 

8754 CGTGAGTGCCAGGGCAGGCTCAGGGCGCGGCGGCGGGCTGGGCTTGGGGCTCCTCTCTAC 
CACCGAGCAAGGTGTGTGGGGACCCCTGACAGTGCACACGTCTCGGAAGTCCAGCAGACC 
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GTTTCCTGAA6TCCTGAGAAGGCCAGAGACCTCCCTTCT6CCTTTCCCAGCCCCCACCTC 
GCTCCTTATGAAGCAGGTGGGCAGGGACAACCAGGGCTGGGGTTATGAGTGCACGGGGAT 
GGCCATGTGAAGCCTTCGTGCTTGCCCAGGTGTGCTGGTGTTGGTTGTGTGTGCGGGGAC 
[G,A] 

GCTATGTGAAGCCCTCACACTCGCCCAGGTGCGTCGGCATCAGGTATGTGTGCCGGGACA 
GCCATGTGAAGCCCTCACACTCACCCAGGTGCGTCGGCATCAGTTGTGTGTGTGGGGACG 
GCCATGTGAAGCCCTCACACTCGCCCAGGTGTGCTGGCTTTGGTTGTGTGTGCAGGGATG 
GCCACATGAAGCCCTCACACTCGCCCAGGTGCGTCAGCATCAGGTGTGTGTGCGGGGACG 
GCCATGTGAAGCCCTCACACTCGCCCAGGTGCGTTGATGTTGTGTGTGCAGGGATGGCCA 

8847 GCACACGTCTCGGAAGTCCAGCAGACCGTTTCCTGAAGTCCTGAGAAGGCCAGAGACCTC 
CCTTCTGCCTTTCCCAGCCCCCACCTCGCTCCTTATGAAGCAGGTGGGCAGGGACAACCA 
GGGCTGGGGTTATGAGTGCACGGGGATGGCCATGTGAAGCCTTCGTGCTTGCCCAGGTGT 
GCTGGTGTTGGTTGTGTGTGCGGGGACGGCTATGTGAAGCCCTCACACTCGCCCAGGTGC 
GTCGGCATCAGGTATGTGTGCCGGGACAGCCATGTGAAGCCCTCACACTCACCCAGGTGC 
[G,A] 

TCGGCATCAGTTGTGTGTGTGGGGACGGCCATGTGAAGCCCTCACACTCGCCCAGGTGTG 
CTGGCTTTGGTTGTGTGTGCAGGGATGGCCACATGAAGCCCTCACACTCGCCCAGGTGCG 
TCAGCATCAGGTGTGTGTGCGGGGACGGCCATGTGAAGCCCTCACACTCGCCCAGGTGCG 
TTGATGTTGTGTGTGCAGGGATGGCCATGTGAAGCCCTCACACTCACCCAGGTGCGTTGA 
TGTCAGTTGTGTGTGCAGGGACAGCCATGTGAAGCCCTCAGACTAGCCCAGGTGTGTCGG 

11114 AGACGCATCCCCCCAGCCCCTGCTTCGCTGCCGCGTGGGGTCGGCCTGTGTGTGCGTCTC 
CATCTCCTCCCCTCTTCTCATAAGGACACCAGGCATTGGATTTAGGGCCCACCCTGATCC 
AGTATGGCCCCATCTTATCTTGATGATATCTGCAAAGACCTCACTTCCAAATGAGGTCAC 
ATTCACAGGTACCCAGGATTAGAATTTGAGTGTGTCA I I I I I GGGGACACAGTTTGGCCC 
ATACCACCAGGATGTGGCTGATATTCACCAAGGAGTAGCTATGGTTGTGTGTTGATGTCA 
[G t A] 

GGTGACGGTGATGACCCTGGGTCCCTCGGTGGTCCCCTTGCCCCTGAGTCTGCCTGAGCC 
TGTGGTGGATGTCCTGGGAAACTCTTGTGCCTCAGCCCCCGTGCAGCCTCCTCAGACCTG 
GTGGGCCCTGTGTTGCTCCTGGGCAGAAGACGGGTGTCAGTCCCCTCCTCACCATGATGT 
GGGGGGCAGGGGTGGGGTCATGCCCTGGGTGCCCTGATTTTGGGGGGAACACGGCCCCCC 
AGTGGGTCAGGCTCCCATCCTCGCCCCTCCTCCAGGACGGCTGCCGGCAGCCCTGGGTGT 

19741 GTCAGAGGAAGAGCAGGGCAGATCCAGGCCAGGCACAGACCCTGCCATGGGGTACTGCTG 
GCGGCGCTACACACACCTTGGGAAGGGAGTCCCTGTGGAAAGGGGTCGTGGTCACACATC 
TAGGTGACACAGCCCGGCTTGGGCGCTGCTCAGAGCCACCCCTTCCAGATGGTTCTGGAG 
CAGCTCCTCAGGCTTCTGGTGGCCTCTCTGCCTAGGAAAACATGGCTGTGGACGTTGCAG 
GATGACCAACAGCCCCTGCCACTGGGCTGCACACAGGGCCACGACGGGCGCTCATGTTCT 
[A,G] 

CATCACTGGCGCCCACCCCAGCCCCTCCCACCTTGTGTCGCTGTGAATCGCAGGATCCCA 
GCGGCTCAGTCGGACCCTCATTCCTGAGTAGTCTGAGCCTTAGGTCACCGTCACCTTCTC 
AGGCCGGCCCGAGTTTGCAGACTTGTCTGTCTATATCAGGGTTAGACCAGAGAGTGCTGA 
GACACAGCAGATCACCCAGCCTGTCCTC MCI! GATGACTAAGGACAGGTCCCCTGCCAG 
GATCGTGACTCCTTTAGGGGAGGCCACAGTGACAGGGCAAAGCCTGGAGGGAGAGAGCCA 

20908 CCTGAGGTCAGGAGTTCAAGACTGGCGTGGCCAACATGGTGAAACCCCGTCTCTACAAAA 
ATTAGCTGGGTATGGTGGTGGGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGCA 
GAATCACTTGAACCTGGGAGGCGGAGTTGGCAGTGAGCCAAGATCACGCCACTGCACTCT 
AGTCTGGGTGACAGAGCGAGACCAAAAATAGGATCCATGGATAGCAGGCAAGAGTGTCCA 
GGTGTTCGAGGCACAGACGACACTGTGACAGGGAAGAGTCCCCTTAGCCCTGGCTGGGGC 
[C,T] 

GTGAAAGCATGCTGTTGTCCGTCTCGGGTGAACGCAGACTGTTGTCACGCATTGCATAAC 
GATGCTTCCGTCACTGGCCAATCGCATGGGGGGGTGGTCCCGTAAGATGGTAACACTGGG 
TTTTGCTGTACGTTTTGTATGTCTAGATAGGGTTGAGCGTTCTGGTGTGTACCCACTCAC 
ACGTCCCCTCCGACCTTCAGAGCCCAGCTCCCTCCCTCCCAGGGCCTTGGCTGTGACGTG 
GGTGACTTCCTATGGATCTGAGGTTCTGTGGTCCTCACAAGTGGGCATCCTCTGGCCTCA 

22728 CACAGATGACTGGCAGCAGGCAAGGCAGGCCAGAAATAGCAGCAGCTGCCACGGTGGGGC 
CCAAGGGAGGATGGATGCTCCCTCTGCCCGCACGGGGCAAGGAGGGCCTCCTGGAGGAGG 
TGGGTCTGAGCTCTTATGGACAGGACGTGCAGGGCAGCACGTGCAGACGGCTGAGGGCAC 
TGACTGGCACCCTGGGGATCAGACGACCGGGTGAAGAATGAGGCTTAGCCGAGCCTCATT 
CCCAAGTCACTGACCTATGGCACCTGCACAGTCAGGCCTTTCGGCTTCTGGCTGGAAACA 
[T f C] 

GCCGAGCCTCGCCAGCATGCTCACGTGCCCCCACCCGTCCCCAGGCTCCCTGCCAGTGTG 
TCGGGAGCATGGCCTCTCCAGCAGACACCGAGCCTGTGGCCCACGTTTGGGCATCCACGC 
CATGGCCTATCCCATGAGCCCGTGGGCAGGTCATGGGACCGTGAGGCCAGGGAGGTGGGG 
GCATAACGCCCTCCATGTGTTCCTGCCACCCCAGGCTGAAGAGGAGCCAGGACCTCCCGA 
GGGAGCCTCTCCCCGCCCCTGAGTCCAGGGATGCTGCGGAGCCTGTGGAGGACGAGGCGG 

23406 ACCCCTTGAGCTGCTTTGGAGACTCGGCTGCCAGAGGGAGGGCCATGGGCCGAGGCCTGG 
CATTCACGTTCCCACCCAGCCTGGCTGGCGGTGCCCACAGTGCCCCGGACACATTTCACA 
CCTCAGGCTCGTGGTGGTGCAGGGGACAAGAGGCTGTGGGTGCAGGGGACACCTGTGGAG 
GGCATTTCCCGTGGGCCCCCGAGACCCGCCTAGATGGAGGAAGCGCTGCTGGGCGCCCTC 
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TTACCGCTCACGGGGAGCTGGGGCCATGGATGGGACAGGAGTCTTTGTCCCTGCTCAGCC 
[C,T] 

GGAGGCTGTGCACGGCCCTCGTCACAAGGTGACCCTTGCAGCACAGGCCGCGGGTGCCCC 
AGGCTCGGCTCAGTTCTTGGAGGTCAAGGGCATGGGTTGGGGTAGTGGGTGGGGAGGTGA 
ATGTTTTCTAGAGATTCAAACTGCTCCAGCAATTTCTGTATAGTTTTCACCTCTGAGAAT 
TACAATGTGAGAACCGCTCGATGTTGCATGTTCTGCGTACGTCCTGTGTCTGCCTGGCCG 
TCAGGCCGGTGCCTGCCGTTTCTGGTTGGCCTGGACTTGGGGCAGCCAGTGGGGTGGGCA 

24078 GGCTGTGGAAGGCAGAGACGGTCCTGGAGGCAGAGGAGCCCAGGACAGCACCGTGCACCG 
TGGAGCCGCCGCAGTGCCGGGCAGTGCTTGGCCCTCCATAAAGGGACGTATCCCTCTCAC 
TGTGGCTGGGTGGTTCTGTGGTTGGAACTGTAACTAACTGGGTAAACGGCCTGTGTGCTT 
CTCTCTGGTCTCGCTGGAGGAGGACGGGCTCAGCCCGTCAGCCCAGCGCTCCAGACAGGC 
CTGTGCTGGTTTCCTCTGAGGAAATGGGTGTGGCGGGTCTGTGCCCCTTCCCAGGACAGC 
[G,A] 

GCCATAGTGGACATGTGCCTAGACCTGTGTCCATGAGCCCCACTGCACCCCTGGCAAACA 
GGGCCCTCCCGTCCTTGGCTGGGCTGCGAGATGGAGATGACAACGGCCAAAGAACATTTG 
GGGAAGAACCGGCCATGCCACGAGCAGAGTCAGAAGTCCGAGGGGATAGAATGCAGCTTC 
CCGTCCCCCACCCACCCCTGTCCTCTGAATCATGGCAGAAACTAGCCTTCCAGCCCTCAG 
CAGCTCACATGGGGGACACGGCACCCAAATCACCACCAGGAAGGGTGGCCCGGTCTCTGC 

24777 GGAACATGGCAGGGTGGTGCACCTTGTCCTTCTTCGAGACACTGGTGAGGTGTGGGTGCT 
GTCTGGTTCCCTTGATCGCCCCCCACACTGGGGCAGAGTGGGAGATGCTGGTGTGGGGGA 
CATCAGCTCCCACATCTGGGCCAGAGGGAGCCCCGGGAAGGAAATGCTGAGGGCCCAGGG 
CCTTCGCCTGGGATCTGCACAGCTTTATAAGCAGCCCAGGGTGAGAGATGGGCCTGTCTG 
TGGTTCCCAGAGACCACGGCAGGAAATTCTCTGTCACCATCGGTGCATGGGCAGGGGCCA 
[G,A] 

AGAACCGGTGGCACAAGGTGTCCCTGGCTCTCTGCTCAACAAACAGCGAGTGCCCAGTGA 
CTGCGACCAGGCCCCGCTCTTGGGATGAGGACAACCGTCTGGGAACGTCCACGCACCCTT 
TATGAGACACAGCACGCGCCAGCACCGCAGTCACACACCGGGGGCTCGGGTCAGCCTCAT 
AGCTGCCCGGCCTTGAGTGCTGGGCCTGCGTCTGTGAGCAGCGCCCACCTGGGTGGCGGT 
GGTGTGTGCTTCACTTCCACACACTCCCGTGCATGCTCCCCGGCCTTCTGGGGTGGCTCG 
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